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POLYPEPTIDE-POLYMER CONJUGATES HAVING ADDED AND/OR REMOVED ATTACHMENT GROUPS 

FIELD OF THE INVENTION 

The present invention relates to polypeptide-polyraer 
5 conjugates having added and/or removed one or more attachment 
groups for coupling polymeric molecules on the surface of the 3D 
structure of the polypeptide, a method for preparing polypeptide- 
polymer conjugates of the invention, the use of said conjugated 
for reducing the immunogenicity and allergenicity, and 
10 compositions comprising said conjugate. 

BACKGROUND OF THE INVENTION 

The use of polypeptides, including enzymes, in the 
circulatory system to obtain a particular physiological effect is 

15 well-known in the medical arts. Further, within the arts of 
industrial applications, such as laundry washing, textile 
bleaching, person care, contact lens cleaning, food and feed 
preparation enzymes are used as a functional ingredient. One of 
the important differences between pharmaceutical and industrial 

20 application is that for the latter type of applications (i.e. 
industrial applications) the polypeptides (often enzymes) are not 
intended to enter into the circulatory system of the body. 

Certain polypeptides and enzymes have an unsatisfactory 
stability and may under certain circumstances - dependent on the 

25 way of challenge - cause an immune response, typically an IgG 
and/or IgE response. 

It is today generally recognized that the stability of 
polypeptides is improved and the immune response is reduced when 
polypeptides, such as enzymes, are coupled to polymeric molecules. 

30 It is believed that the reduced immune response is a result of the 
shielding of (the) epitope (s) on the surface of the polypeptide 
responsible for the immune response leading to antibody formation 
by the coupled polymeric molecules. 

Techniques for conjugating polymeric molecules to polypeptides 

35 are well-known in the art. 

One of the first suitable commercially techniques was described 
back in the early 1970' ies and disclosed in e.g. US patent no. 
4,179,337. Said patent concerns non- immunogenic polypeptides, such 
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as enzymes and peptide hormones coupled to polyethylene glycol 
(PEG) or polypropylene glycol (PPG). At least 15% of polypeptides 1 
physiological activity is maintained. 

GB patent no. 1,183,257 (Crook et al.) describes chemistry for 
5 conjugation of enzymes to polysaccharides via a triazine ring. 

Further, techniques for maintaining of the enzymatic activity 
of enzyme-polymer conjugates are also known in the art. 

WO 93/15189 (Veronese et al.) concerns a method for maintaining 
the activity in polyethylene glycol-modif ied proteolytic enzymes 
10 by linking the proteolytic enzyme to a macromolecularized 
inhibitor. The conjugates are intended for medical applications. . 

It has been found that the attachment of polymeric molecules to 
a polypeptide often has the effect of reducing the activity of the 
polypeptide by interfering with the interaction between the 
15 polypeptide and its substrate. EP 183 503 (Beecham Group PLC) 
discloses a development of the above concept by providing 
conjugates comprising pharmaceutical ly useful proteins linked to 
at least one water-soluble polymer by means of a reversible 
linking group. 

20 EP 471,125 (Kanebo) discloses skin care products comprising a 
parent protease (Bacillus protease with the trade name Esperase®) 
coupled to polysaccharides through a triazine ring to improve the 
thermal and preservation stability. The coupling technique used is 
also described in the above mentioned GB patent no. 1,183,257 

25 (Crook et al. ) • 

JP 3083908 describes a skin cosmetic material which 
contains a transglutaminase from guinea pig liver modified with 
one or more water-soluble substance such as PEG, starch, 
cellulose etc. The modification is performed by activating the 

3 0 polymeric molecules and coupling them to the enzyme. The 
composition is stated to be mild to the skin. 

However, it is not always possible to readily couple 
polymeric molecules to polypeptides and enzymes. Further, there is 
still a need for polypeptide-polymer conjugates with an even more 

35 reduced immunogenicity and/ or allergenicity. 

SUMMARY OF THE INVENTION 

It is the object of the present invention to provide improved 
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polypeptide-polymer conjugates suitable for industrial and 

pharmaceutical applications. 

The term "improved polypeptide-polymer conjugates" means in the 

context of the present invention conjugates having a reduced 
5 immune response in humans and animals and/or a improved stability. 

As will be described further below the immune response is 

dependent on the way of challenge. 

The present inventors have found that polypeptides, such as 

enzymes, may be made less immunogenic and/or allergenic by adding 
10 and/ or removing one or more attachment groups on the surface of 

the parent polypeptide to be coupled to polymeric molecules. 

When introducing pharmaceutical polypeptide directly into the 

circulatory system (i.e. bloodstream) the potential risk is an 

immunogenic response in the form of mainly IgG, IgA and/or IgM 
15 antibodies. In contrast hereto, industrial polypeptides, such as 

enzymes used as a functional ingredient in e.g. detergents, are 

not intended to enter the circulatory system. The potential risk 

in connection with industrial polypeptides is inhalation causing 

an allergenic response in the form of mainly IgE antibody 
20 formation. 

Therefore, in connection with industrial polypeptides the 
potential risk is respiratory allergenicity caused by inhalation, 
intratracheal and intranasal presentation of polypeptides. 

The main potential risk of pharmaceutical polypeptides is 
25 immunogenicity caused by intradermal ly, intravenously or subcu- 
taneously presentation of the polypeptide. 

It is to be understood that reducing the "immunogenicity" 
and reducing the "respiratory allergenicity" are two very 
different problems based on different routes of exposure and on 
30 two very different immunological mechanisms: 

The term "immunogenicity" used in connection with the 
present invention may be referred to as allergic contact 
dermatitis in a clinical setting and is a cell mediated delayed 
immune response to chemicals that contact and penetrate the skin. 
35 This cell mediated reaction is also termed delayed contact 
hypersensitivity (type IV reaction according to Gell and Combs 
classification of immune mechanisms in tissue damage) . 

The term "allergenicity" or "respiratory allergenicity" is an 



- WO 98/35026 



4 



PCT/DK98/00046 



immediate anaphylactic reaction (type I antibody-mediated reaction 
according to Gell and Combs) following inhalation of e.g. 
polypeptides. 

According to the present invention it is possible to provide 
5 polypeptides with a reduced immune response and/ or improved 
stability, which has a substantially retained residual activity. 

The allergic and the immunogenic response are in one term, at 
least in the context of the present invention called the "immune 
response" . 

10 In the first aspect the invention relates to a polypeptide- 
polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 

15 comparison to the number of attachment groups available on the 
corresponding parent polypeptide, and/or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 

20 of the polypeptide in comparison to the number of attachment 
groups available on the corresponding parent polypeptide. 

The term "parent polypeptide" refers to the polypeptide to be 
modified by coupling to polymeric molecules. The parent 
polypeptide may be a naturally-occurring (or wild-type) 

25 polypeptide or may be a variant thereof prepared by any suitable 
means. For instance, the parent polypeptide may be a variant of a 
naturally-occurring polypeptide which has been modified by 
substitution, deletion or truncation of one or more amino acid 
residues or by addition or insertion of one or more amino acid 

3 0 residues to the amino acid sequence of a naturally-occurring 
polypeptide. 

A "suitable attachment group" means in the context of the 
present invention any amino acid residue group on the surface of 
the polypeptide capable of coupling to the polymeric molecule in 
35 question. 

Preferred attachment groups are amino groups of Lysine 
residues and the N-terminal amino group. Polymeric molecules may 
also be coupled to the carboxylic acid groups (-COOH) of amino 
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acid residues in the polypeptide chain located on the surface. 
Carboxylic acid attachment groups may be the carboxylic acid group 
of Aspartate or Glutamate and the C-terminal COOH-group. 

A "functional site" means any amino acid residues and/or 
5 cofactors which are known to be essential for the performance of 
the polypeptide, such as catalytic activity, e.g. the catalytic 
triad residues, Histidine, Aspartate and Serine in Serine 
proteases, or e.g. the heme group and the distal and proximal 
Histidines in a peroxidase such as the Arthrojnyces ramosus 
10 peroxidase. 

In the second aspect the invention relates to a method for 
preparing improved polypeptide-polymer conjugates comprising the 
steps of: 

a) identifying amino acid residues located on the surface of the 
15 3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a 

20 suitable attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
selected in step b) at or close to the functional site(s), 

d) coupling polymeric molecules to the mutated polypeptide. 

The invention also relates to the use of a conjugate of the 
25 invention and the method of the invention for reducing the 
immunogenicity of pharmaceuticals and reducing the allergenicity 
of industrial products. 

Finally the invention relates to compositions comprising a 
conjugate of the invention and further ingredients used in 
30 industrial products or pharmaceuticals. 



BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 shows the ant i- lipase serum antibody levels after 5 
weekly immunizations with i) control ii) unmodified lipase 
35 variant, iii) lipase var iant-SPEG . (X: log (serum dilution); Y 
Optical Density (490/620)). 



DETAILED DESCRIPTION OF THE INVENTION 
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It is the object of the present invention to provide improved 
polypeptide-polymer conjugates suitable for industrial and 
pharmaceutical applications* 

Even though polypeptides used for pharmaceutical applications 
5 and industrial application can be quite different the principle of 
the present invention may be tailored to the specific type of 
parent polypeptide (i.e. enzyme, hormone peptides etc.). 

The inventors of the present invention have provided improved 
polypeptide-polymer conjugates with a reduced immune response in 
10 comparison to conjugates prepared from the corresponding parent 
polypeptides. 

The present inventors have found that polypeptides, such as 
enzymes, may be made less immunogenic and/or less allergenic by 
adding one or more attachment groups on the surface of the parent 
15 polypeptide. In addition thereto the inventors have found that a 
higher percentage of maintained residual functional activity may 
be obtained by removing attachment groups at or close to the 
functional site(s). 

In the first aspect the invention relates to an improved 
2 0 polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 
comparison to the number of attachment groups available on the 

25 corresponding parent polypeptide, and/or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 
of the polypeptide in comparison to the number of attachment 

30 groups available on the corresponding parent polypeptide. 

Whether the attachment groups should be added and/ or removed 
depends on the specific parent polypeptide. 

a) Addition of A ttachment groups 
35 There may be a need for further attachment groups on the 
polypeptide if only few attachment groups are available on the 
surface of the parent polypeptide. The addition of one or more 
attachment groups by substituting or inserting one or more amino 
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acid residues on the surface of the parent polypeptide increases 
the number of polymeric molecules which may be attached in 
comparison to the corresponding parent polypeptide. Conjugates 
with an increased number of polymeric molecules attached thereto 
5 are generally seen to have a reduced immune response in comparison 
to the corresponding conjugates having fewer polymeric molecules 
coupled thereto. 

Any available amino acid residues on the surface of the 
polypeptide, preferentially not being at or close to the 
10 functional site(s), such as the active site(s) of enzymes, may in 
principle be subject to substitution and/ or insertion to provide 
additional attachment groups. 

As will be described further below the location of the 
additional coupled polymeric molecules may be of importance for 
15 the reduction of the immune response and the percentage of 
maintained residual functional activity of the polypeptide itself. 

A conjugate of the invention may typically have from 1 to 25 , 
preferentially 1 to 10 or more additional polymeric molecules 
coupled to the surface of the polypeptide in comparison to the 
20 number of polymeric molecules of a conjugate prepared on the basis 
of the corresponding parent polypeptide. 

However, the optimal number of attachment group to be added 
depends (at least partly) on the surface area (i.e. molecular 
weight) of the parent polypeptide to be shielded by the coupled 
25 polymeric molecules, and further off -course also the number of 
already available attachment groups on the parent polypeptide. 

b) Removing Attachment groups 

In the case of enzymes or other polypeptides performing their 

30 function by interaction with a substrate or the like, polymeric 
molecules coupled to the polypeptide might be impeded by the 
interaction between the polypeptide and its substrate or the like, 
if they are coupled at or close to the functional site(s) (i.e. 
active site of enzymes) . This will most probably cause reduced 

35 activity. 

In the case of enzymes having one or more polymeric molecules 
coupled at or close to the active site a substantial loss of 
residual enzymatic activity can be expected. Therefore, according 
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to the invention conjugates may be constructed to maintain a 

higher percentage of residual enzymatic activity in comparison to 

a corresponding conjugates prepared on the basis of the parent 

enzyme in question. This may be done by substituting and/or 
5 deleting attachment groups at or close to the active site, hereby 

increasing the substrate affinity by improving the accessibility 

of the substrate in the catalytic cleft. 

An enzyme-polymer conjugate of the invention may typically have 

from 1 to 25, preferably 1 to 10 fewer polymeric molecules coupled 
10 at or close to the active site in comparison to the number of 

polymeric molecules of a conjugate prepared on the basis of the 

corresponding parent polypeptide. 

As will be explained below "at or close to" the functional 

site(s) means that no polymeric molecule (s) should be coupled 
15 within 5 A, preferably 8 A, especially 10 A of the functional 

site(s) . 

Removal of attachment groups at or close to the functional 
site(s) of the polypeptide may advantageously be combined with 
addition of attachment groups in other parts of the surface of the 
20 polypeptide. 

The total number of attachment groups may this way be 
unchanged, increased or decreased. However the location (s) of the 
total number of attachment group (s) is (are) improved assessed by 
the reduction of the immune response and/or percentage of 
25 maintained residual activity. Improved stability may also be 
obtained this way. 

The number of attachment groups 

Generally seen the number of attachment groups should be 
30 balanced to the molecular weight and/or surface area of the 
polypeptide. The more heavy the polypeptide is the more polymeric 
molecules should be coupled to the polypeptide to obtain 
sufficient shielding of the epitope (s) responsible for antibody 
formation. 

35 Therefore, if the parent polypeptide molecule is relatively 
light (e.g. 1 to 35 kDa) it may be advantageous to increase the 
total number of coupled polymeric molecules (outside the 
functional site(s)) to a total between 4 and 20. 
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If the parent polypeptide molecules is heavier, for instance 35 
to 60 kDa, the number of coupled polymeric molecules (outside the 
functional site(s)) may advantageously be increased to 7 to 40, 
and so on. 

5 The ratio between the molecular weight (Mw) of the polypeptide 
in question and the number of coupled polymeric molecules 
considered to be suitable by the inventors is listed below in 
Table 1. 

10 Table 1 



Molecular weight of parent 
polypeptide (Mw) kDa 


Number of polymeric 
molecules coupled to the 
polypeptide 


1 to 35 


4-20 


35 to 60 


7-40 


60 to 80 


10-50 


80 to 100 


15-70 


more than 100 


more than 20 



Reduced immune response vs. maintained resid ual enzymatic activity 
Especially for enzymes, in comparison to many other types of 
polypeptides, there is a conflict between reducing the immune 

15 response and maintaining a substantial residual enzymatic activity 
as the activity of enzymes are connected with interaction between 
a substrate and the active site often present as a cleft in the 
enzyme structure. 

Without being limited to any theory it is believed that the 

20 loss of enzymatic activity of enzyme-polymer conjugates might be a 
consequence of impeded access of the substrate to the active site 
in the form of spatial hindrance of the substrate by especially 
bulky and/ or heavy polymeric molecules to the catalytic cleft. It 
might also, at least partly, be caused by disadvantageous minor 

25 structural changes of the 3D structure of the enzyme due to the 
stress made by the coupling of the polymeric molecules. 

Maintained residual activity 

A polypeptide-polymer conjugates of the invention has a 
3 0 substantially maintained functional activity. 
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10 

A "substantially" maintained functional activity is in the 
context of the present invention defined as an activity which is 
at least between 20% and 30%, preferably between 30% and 40%, more 
preferably between 40% and 60%, better from 60% up to 80%, even 
5 better from 80% up to about 100%, in comparison to the activity of 
the conjugates prepared on the basis of corresponding parent 
polypeptides. 

In the case of polypeptide-polymer conjugates of the 
invention where no polymeric molecules are coupled at or close to 

10 the functional site(s) the residual activity may even be up to 
100% or very close thereto. If attachment group (s) of the parent 
polypeptide is (are) removed from the functional site the activity 
might even be more than 100% in comparison to modified (i.e. 
polymer coupled) parent polypeptide conjugate. 

15 Position of coupled polymeric molecules 

To obtain an optimally reduced immune response (i.e. 
immunogenic and allergenic response) the polymeric molecules 
coupled to the surface of the polypeptide in question should be 
located in a suitable distance from each other. 

20 In a preferred embodiment of the invention the parent 
polypeptide is modified in a manner whereby the polymeric 
molecules are spread broadly over the surface of the polypeptide. 
In the case of the polypeptide in question has enzymatic activity 
it is preferred to have as few as possible, especially none, 

25 polymeric molecules coupled at or close to the area of the active 
site. 

In the present context "spread broadly over the surface of the 
polypeptide" means that the available attachment groups are 
located so that the polymeric molecules shield different parts of 

30 the surface, preferable the whole or close to the whole surface 
area away from the functional site(s), to make sure that 
epitope (s) are shielded and hereby not recognized by the immune 
system or its antibodies. 

The area of antibody-polypeptide interaction typically 

35 covers an area of 500 A 2 , as described by Sheriff et al. 

(1987), Proc. Natl. Acad. Sci. USA 84, p. 8075-8079. 500 A 2 
corresponds to a rectangular box of 25 A x 20 A or a circular 
region of radius 12.6 A. Therefore, to prevent binding of 
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antibodies to the epitope (s) to the polypeptide in question it 
is preferred to have a maximum distance between two attachment 
groups around 10 A. 

Consequently, amino acid residues which are located in excess 
5 of 10 A away from already available attachment groups are 

suitable target residues. If two or more attachment groups on the 
polypeptide are located very close to each other it will in most 
cases result in that only one polymeric molecule will be coupled. 
To ensure a minimal loss of functional activity it is preferred 

10 not to couple polymeric molecules at or close to the functional 
site(s). Said distance depends at least partly on the bulkiness of 
the polymeric molecules to be coupled, as impeded access by the 
bulky polymeric molecules to the functional site is undesired. 
Therefore, the more bulky the polymeric molecules are the longer 

15 should the distance from the functional site to the coupled 
polymeric molecules be. 

To maintain a substantial functional activity of the 
polypeptide in question attachment groups located within 5 A, 
preferred 8 A, especially 10 A from such functional site(s) 

20 should be left uncoupled and may therefore advantageously be 
removed or changed by mutation. Functional residues should 
normally not be mutated/ removed, even though they potentially 
can be the target for coupling polymeric molecules. In said 
case it may thus be advantageous to chose a coupling chemistry 

25 involving different attachment groups. 

Further, to provide a polypeptide having coupled polymeric 
molecules at (a) known epitope (s) recognizable by the immune 
system or close to said epitope (s) specific mutations at such 
sites are also considered advantageous according to the invention. 

30 If the position of the epitope (s) is (are) unknown it is 
advantageous to couple several or many polymeric molecules to the 
polypeptide. 

As also mentioned above it is preferred that said attachment 
groups are spread broadly over the surface. 

35 

The attachment group 

Virtually all ionized groups, such as the amino groups of 
Lysine residues, are located on the surface of the polypeptide 
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molecule (see for instance Thomas E. Creighton, (1993), 
"Proteins", W.H. Freeman and Company, New York) . 

Therefore, the number of readily accessible attachment groups 
(e.g. amino groups) on a modified or parent polypeptide equals 
5 generally seen the number of Lysine residues in the primary 
structure of the polypeptide plus the N-terminus amino group. 

The chemistry of coupling polymeric molecules to amino groups 
are quite simple and well established in the art. Therefore, it is 
preferred to add and/or remove Lysine residues (i.e. attachment 
10 groups) to/ from the parent polypeptide in question to obtain 
improved conjugates with reduced immunogenicity and/ or 
allergenicity and/or improved stability and/or high percentage 
maintained functional activity. 

Polymeric molecules may also be coupled to the carboxylic 
15 groups (-C00H) of amino acid residues on the surface of the 
polypeptide. Therefore, if using carboxylic groups (including the 
C-terminal group) as attachment groups addition and/ or removal of 
Aspartate and Glutamate residues may also be a suitable according 
to the invention. 

20 If using other attachment groups, such as -SH groups, they 
may be added and/ or removed analogously. 

Substitution of the amino acid residues is preferred over 
insertion, as the impact on the 3D structure of the polypeptide 
normally will be less pronounced. 

25 Preferred substitutions are conservative substitutions. In the 
case of increasing the number of attachment groups the 
substitution may advantageously be performed at a location having 
a distance of 5 A, preferred 8 A, especially 10 A from the 
functional site(s) (active site for enzymes). 

30 An example of a suitable conservative substitution to obtain 

an additional amino attachment group is a Arginine to Lysine 
substitution. Examples of conservative substitutions to obtain 
additional carboxylic attachment groups are Aspargine to 
Aspartate/Glutamate or Glutamine to Aspartate/Glutamate 

35 substitutions. To remove attachment groups a Lysine residue may be 
substituted with a Arginine and so on. 



The parent polypeptide 
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In the context of the present invention the term "polypeptides" 
includes proteins, peptides and/or enzymes for pharmaceutical or 
industrial applications. Typically the polypeptides in question 
have a molecular weight in the range between about 1 to 100 kDa, 
5 often 15 kDa and 100 kDa. 

Pharmaceutical polypeptides 

The term "pharmaceutical polypeptides" is defined as polypep- 
tides, including peptides, such as peptide hormones, proteins 
10 and/or enzymes, being physiologically active when introduced into 
the circulatory system of the body of humans and/or animals. 

Pharmaceutical polypeptides are potentially immunogenic as they 
are introduced into the circulatory system. 

Examples of "pharmaceutical polypeptides" contemplated 
15 according to the invention include insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin, parathyroid hormone, 
pigmentary hormones, somatomedin, erythropoietin, luteinizing 
hormone, chorionic gonadotropin, hypothalmic releasing factors, 
antidiuretic hormones, thyroid stimulating hormone, relaxin, 
20 interferon, thrombopoietin (TPO) and prolactin. 

Industrial polypeptides 

Polypeptides used for industrial applications often have an 
enzymatic activity. Industrial polypeptides {e.g. enzymes) are (in 
2 5 contrast to pharmaceutical polypeptides) not intended to be 
introduced into the circulatory system of the body. 

It is not very like that industrial polypeptides, such as 
enzymes used as ingredients in industrial compositions and/or 
products, such as detergents and personal care products, including 
30 cosmetics, come into direct contact with the circulatory system of 
the body of humans or animals, as such enzymes (or products 
comprising such enzymes) are not injected (or the like) into the 
bloodstream. 

Therefore, in the case of the industrial polypeptide the 
35 potential risk is respiratory allergy (i.e. IgE response) as a 
consequence of inhalation to polypeptides through the respiratory 
passage • 

In the context of the present invention "industrial polypep- 
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tides" are defined as polypeptides, including peptides, proteins 
and/or enzymes, which are not intended to be introduced into the 
circulatory system of the body of humans and/or animals . 

Examples of such polypeptides are polypeptides, especially 
5 enzymes, used in products such as detergents, household article 
products, agrochemicals, personal care products, such as skin care 
products, including cosmetics and toiletries, oral and dermal 
pharmaceuticals, composition use for processing textiles, 
compositions for hard surface cleaning, and compositions used for 
10 manufacturing food and feed etc. 

Enzymatic activity 

Pharmaceutical or industrial polypeptides exhibiting enzymatic 
activity will often belong to one of the following groups of 

15 enzymes including Oxidoreductases (E.C. 1, "Enzyme Nomenclature, 
(1992), Academic Press, Inc.)/ such as laccase and Superoxide 
dismutase (SOD); Transferases, (E.C. 2), such as transglutaminases 
(TGases) ; Hydrolases (E.C. 3), including proteases, especially 
subtilisins, and lipolytic enzymes; Isomerases (E.C. 5), such as 

2 0 Protein disulfide Isomerases (PDI) . 

Hydrolases 

Proteolytic enzymes 

Contemplated proteolytic enzymes include proteases selected 
25 from the group of Aspartic proteases, such pepsins, Cysteine 

proteases, such as Papain, Serine proteases, such as subtilisins, 

or metallo proteases, such as Neutrase®. 

Specific examples of parent proteases include PD498 (WO 

93/24623 and SEQ ID NO. 2), Savinase® (von der Osten et al., 
30 (1993), Journal of Biotechnology, 28, p. 55+, SEQ ID NO 3), 

Proteinase K (Gunkel et al., (1989), Eur. J. Biochem, 179, p. 185- 

194), Proteinase R (Samal et al, (1990), Mol. Microbiol, 4, p. 

1789-1792), Proteinase T (Samal et al., (1989), Gene, 85, p. 329- 

333), Subtilisin DY (Betzel et al. (1993), Arch. Biophys, 302, no. 
35 2, p. 499-502), Lion Y (JP 04197182-A) , Rennilase® (Available from 

Novo Nordisk A/S) , JA16 (WO 92/17576) , Alcalase® (a natural 

subtilisin Carlberg variant) (von der Osten et al., (1993), 

Journal of Biotechnology, 28, p. 55+). 
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Lipolytic enzymes 

Contemplated lipolytic enzymes include Humicola lanuginosa 

lipases, e.g. the one described in EP 258 068 and EP 305 216 (See 
5 SEQ ID NO 6 below) , Humicola insolens, a Rhizomucor miehei lipase, 

e.g. as described in EP 238 023 , Absidia sp. lipolytic enzymes (WO 

96/13578), a Candida lipase, such as a C. antarctica lipase, e.g. 

the C. antarctica lipase A or B described in EP 214 761, a 

Pseudomonas lipase such as a P. alcaligenes and P. 
10 pseudoalcaligenes lipase, e.g. as described in EP 218 272, a P. 

cepacia lipase, e.g. as described in EP 331 376, a Pseudomonas sp. 

lipase as disclosed in WO 95/14783, a Bacillus lipase, e.g. a B. 

subtilis lipase (Dartois et al., (1993) Biochemica et Biophysica 

acta 1131, 253-260), a B. stearothermophilus lipase (JP 64/744992) 
15 and a B. pumilus lipase (WO 91/16422). Other types of lipolytic 

include cutinases, e.g. derived from Pseudomonas mendocina as 

described in WO 88/09367, or a cutinase derived from Fusarium 

solani pisi (e.g. described in WO 90/09446) . 

20 Oxidoreductases 

Laccases 

Contemplated laccases include Polyporus pinisitus laccase (WO 
96/00290), Myceliophthora laccase (WO 95/33836), Schytalidium 
laccase (WO 95/338337), and Pyricularia oryzae laccase (Available 
25 from Sigma; . 

Peroxidase 

Contemplated peroxidases include B. pumilus peroxidases (WO 
91/05858) , Myxococcaceae peroxidase (WO 95/11964) , Coprinus 
30 cinereus (WO 95/10602) and Arthromyces ramosus peroxidase 
(Kunishima et al. (1994), J. Mol. Biol. 235, p. 331-344). 

Transferases 

Transglutaminases 

35 Suitable transferases include any transglutaminases disclosed 
in WO 96/06931 (Novo Nordisk A/S) and WO 96/22366 (Novo Nordisk 
A/S). 
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Isomerases 

Protein Disulfide Isomerase 

Without being limited thereto suitable protein disulfide 
isomerases include PDIs described in WO 95/01425 (Novo Nordisk 
5 A/S) . 

The polymeric molecule 

The polymeric molecules coupled to the polypeptide may be any 
suitable polymeric molecule, including natural and synthetic homo- 

10 polymers, such as polyols (i.e. poly-OH) , polyamines (i.e. poly- 
NH 2 ) and polycarboxyl acids (i.e. poly-COOH) , and further hetero- 
polymers i.e. polymers comprising one or more different coupling 
groups e.g. a hydroxy 1 group and amine groups. 

Examples of suitable polymeric molecules include polymeric 

15 molecules selected from the group comprising polyalkylene oxides 
(PAO) , such as polyalkylene glycols (PAG) , including polyethylene 
glycols (PEG) , methoxypolyethylene glycols (mPEG) and polypropylen 
glycols, PEG-glycidyl ethers (Epox-PEG) , PEG-oxycarbonylimidazole 
(CDI-PEG) , Branced PEGs, poly-vinyl alcohol (PVA) , poly- 

20 carboxylates, poly- (vinylpyrolidone) , poly-D,L-amino acids, 
polyethylene-co-maleic acid anhydride, polystyrene-co-malic acid 
anhydrid, dextrans including carboxymethyl-dextrans, heparin, 
homologous albumin, celluloses, including methylcellulose, 
carboxymethylcellulose, ethylcellulose, hydroxyethylcellulose 

25 carboxyethylcellulose and hydroxypropylcellulose, hydrolysates of 
chitosan, starches such as hydroxyethyl-straches and hydroxy 
propyl-starches , glycogen, agaroses and derivates thereof, guar 
gum, pullulan, inulin, xanthan gum, carrageenin, pectin, alginic 
acid hydrolysates and bio-polymers. 

3 0 Preferred polymeric molecules are non-toxic polymeric molecules 
such as (m) polyethylene glycol ((ra)PEG) which further requires a 
relatively simple chemistry for its covalently coupling to 
attachment groups on the enzyme's surface. 

Generally seen polyalkylene oxides (PAO) , such as polyethylene 

35 oxides, such as PEG and especially mPEG, are the preferred 
polymeric molecules, as these polymeric molecules, in comparison 
to polysaccharides such as dextran, pullulan and the like, have 
few reactive groups capable of cross-linking. 
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Even though all of the above mentioned polymeric molecules may 
be used according to the invention the methoxypolyethylene glycols 
(mPEG) may advantageously be used. This arise from the fact that 
methoxyethylene glycols have only one reactive end capable of 
5 conjugating with the enzyme. Consequently, the risk of cross- 
linking is less pronounced. Further, it makes the product more 
homogeneous and the reaction of the polymeric molecules with the 
enzyme easier to control. 

10 Preparation of enzyme variants 

Enzyme variants to be conjugated may be constructed by any 
suitable method. A number of methods are well established in 
the art. For instance enzyme variants according to the 
invention may be generated using the same materials and methods 

15 described in e.g. WO 89/06279 (Novo Nordisk A/S) , EP 130,756 
(Genentech) , EP 479,870 (Novo Nordisk A/S), EP 214,435 
(Henkel) , WO 87/04461 (Amgen) , WO 87/05050 (Genex) , EP appli- 
cation no. 87303761 (Genentech), EP 260,105 (Genencor) , WO 
88/06624 (Gist-Brocades NV) , WO 88/07578 (Genentech) , WO 

20 88/08028 (Genex), WO 88/08033 (Amgen), WO 88/08164 (Genex), 
Thomas et al. (1985) Nature, 318 375-376; Thomas et al. (1987) 
J. Mol. Biol., 193 , 803-813; Russel and Fersht (1987) Nature 
328 496-500. 

25 Generation of site directed mutations 

Prior to mutagenesis the gene encoding the polypeptide of 
interest must be cloned in a suitable vector. Methods for 
generating mutations in specific sites is described below. 

Once the polypeptide encoding gene has been cloned, and 

30 desirable sites for mutation identified and the residue to 
substitute for the original ones have been decided, these 
mutations can be introduced using synthetic oligonucleotides. 
These oligonucleotides contain nucleotide sequences flanking the 
desired mutation sites; mutant nucleotides are inserted during 

35 oligo-nucleotide synthesis. In a preferred method, Site-directed 
mutagenesis is carried out by SOE-PCR mutagenesis technique 
described by Kammann et al. (1989) Nucleic Acids Research 17(13), 
5404, and by Sarkar G. and Sommer, S.S. (1990); Biotechniques 8, 
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404-407. 

Activation of polymers 

If the polymeric molecules to be conjugated with the 
5 polypeptide in question are not active it must be activated by the 
use of a suitable technique. It is also contemplated according to 
the invention to couple the polymeric molecules to the polypeptide 
through a linker. Suitable linkers are well-known to the skilled 
person. 

10 Methods and chemistry for activation of polymeric molecules 
as well as for conjugation of polypeptides are intensively 
described in the literature. Commonly used methods for activation 
of insoluble polymers include activation of functional groups with 
cyanogen bromide, periodate, glu tar aldehyde, biepoxides, 

15 epichlorohydrin, divinylsulfone, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991), "Protein 
immobilisation. Fundamental and applications", Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Cross linking", CRC Press, Boca Raton; G.T. Hermanson et al., 

20 (1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N.Y.). Some of the methods concern activation of insoluble 
polymers but are also applicable to activation of soluble polymers 
e.g. periodate , trichlorotriazine , sulf onylhalides , 

divinylsulfone, carbodiimide etc. The functional groups being 

25 amino, hydroxy 1, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 
considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

30 In the following a number of suitable polymer activation 

methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of poly- 
peptides may be performed with the aid of diimide and for example 

35 amino-PEG or hydrazino-PEG (Pollak et al., (1976), J. Amr. Chem. 
Soc, 98, 289-291) or diazoacetate/ amide (Wong et al., (1992), 
"Chemistry of Protein Conjugation and Crossl inking" , CRC Press). 
Coupling polymeric molecules to hydroxy groups are generally 
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very difficult as it must be performed in water . Usually 
hydrolysis predominates over reaction with hydroxyl groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
5 disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 
selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

10 Techniques involving coupling electrophilically activated 
PEGs to the amino groups of Lysines may also be useful. Many of 
the usual leaving groups for alcohols give rise to an amine 
linkage. For instance, alkyl sulfonates, such as tresylates 
(Nilsson et al., (1984), Methods in Enzymology vol. 104, Jacoby, 

15 W. B., Ed., Academic Press: Orlando, p. 56-66; Nilsson et al. , 
(1987), Methods in Enzymology vol. 135; Mosbach, K. , Ed.; Academic 
Press: Orlando, pp. 65-79; Scouten et al., (1987), Methods in 
Enzymology vol. 135, Mosbach, K. , Ed., Academic Press: Orlando, 
1987; pp 79-84; Crossland et al., (1971), J. Amr. Chem. Soc. 1971, 

20 93, pp. 4217-4219), mesylates (Harris, (1985), supra; Harris et 
al., (1984), J. Polym. Sci. Polym. Chem. Ed. 22, pp 341-352), aryl 
sulfonates like tosylates, and para-nitrobenzene sulfonates can be 
used. 

Organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
25 converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with nucleo- 
philes like amino groups in polypeptides allow stable linkages to 
be formed between polymer and polypeptide. In addition to high 
conjugation yields, the reaction conditions are in general mild 
30 (neutral or slightly alkaline pH, to avoid denaturation and little 
or no disruption of activity) , and satisfy the non-destructive re- 
quirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more un- 
stable decomposing into PEG, dioxane, and sulfonic acid (Zalipsky, 
35 (1995), Bioconjugate Chem., 6, 150-165). Epoxides may also been 
used for creating amine bonds but are much less reactive than the 
above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise 
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to carbamate linkages to Lysines. This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
patent no. 5,122,614, (1992); Zalipsky et al., (1992), Biotechnol. 
Appl. Biochem., 15, p. 100-114; Monfardini et al., (1995), Biocon- 
5 jugate Chem. , 6, 62-69, with imidazole (Allen et al., (1991), 
Carbohydr. Res., 213, pp 309-319), with para-nitrophenol, DMAP (EP 
632 082 Al, (1993), Looze, Y.) etc. The derivatives are usually 
made by reacting the chloroformate with the desired leaving group. 
All these groups give rise to carbamate linkages to the peptide. 

10 Furthermore, isocyanates and isothiocyanates may be employed 

yielding ureas and thioureas, respectively. 

Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5,349,001, (1994), Greenwald et al.). The reactivity of these com- 

15 pounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 
(1992), Zalipsky). This group may be activated with N-hydroxy suc- 

20 cinimide. 

Furthermore, a special linker can be introduced. The oldest 
being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
25 375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
yields a very reactive diazonium salt which in situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlactone derivative of PEG (US patent no. 5,321,095, (1994), 
30 Greenwald, R. B.) thus introducing an additional amide linkage. 

As some peptides do not comprise many Lysines it may be 
advantageous to attach more than one PEG to the same Lysine. This 
can be done e.g. by the use of 1, 3-diamino-2-propanol. 

PEGs may also be attached to the amino-groups of the enzyme 
35 with carbamate linkages (WO 95/11924, Greenwald et al.). Lysine 
residues may also be used as the backbone. 

The coupling technique used in the examples is the N- 
succinimidyl carbonate conjugation technique descried in WO 
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90/13590 (Enzon). 

Method for preparing improved conjugates 

It is also an object of the invention to provide a method for 
5 preparing improved polypeptide-polymer conjugates comprising the 
steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
10 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
15 selected in step b) at or close to the functional site(s) , 

d) coupling polymeric molecules to the mutated polypeptide. 

Step a) Identifying amino acid residues located on the surface of 
the parent polypeptide 

20 

3-dimensional structure HP-structure) 

To perform the method of the invention a 3-dimensional 

structure of the parent polypeptide in question is required. 

This structure may for example be an X-ray structure, an NMR 
25 structure or a model-built structure. The Brookhaven Databank 

is a source of X-ray- and NMR-structures . 

A model-built structure may be produced by the person 

skilled in the art if one or more 3D-structure(s) exist (s) of 

homologous polypeptide (s) sharing at least 30% sequence 
30 identity with the polypeptide in question. Several software 

packages exist which may be employed to construct a model 

structure. One example is the Homology 95.0 package from 

Biosym. 

Typical actions required for the construction of a model 
35 structure are: alignment of homologous sequences for which 3D- 
structures exist, definition of Structurally Conserved Regions 
(SCRs), assignment of coordinates to SCRs, search for 
structural fragments /loops in structure databases to replace 
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Variable Regions, assignment of coordinates to these regions, 
and structural refinement by energy minimization. Regions 
containing large inserts (£3 residues) relative to the known 
3D-structures are known to be quite difficult to model, and 
5 structural predictions must be considered with care. 

Having obtained the 3D-structure of the polypeptide in 
question, or a model of the structure based on homology to 
known structures, this structure serves as an essential 
prerequisite for the fulfillment of the method described below. 

10 

Step b) Selection of target amino acid residues for mutation 
Target amino acid residues to be mutated are according to 
the invention selected in order to obtain additional or fewer 
attachment groups, such as free amino groups (-NH 2 ) or free 
15 carboxylic acid groups (-COOH) , on the surface of the 

polypeptide and/ or to obtain a more complete and broadly spread 
shielding of the epitope (s) on the surface of the polypeptide. 

Conservative substitution 
20 It is preferred to make conservative substitutions in the 

polypeptide, as conservative substitutions secure that the 

impact of the mutation on the polypeptide structure is limited. 
In the case of providing additional amino groups this may be 

done by substitution of Arginine to Lysine, both residues being 
25 positively charged, but only the Lysine having a free amino 

group suitable as an attachment groups. 

In the case of providing additional carboxylic acid groups 

the conservative substitution may for instance be an Aspargine 

to Aspartic acid or Glutamine to Glutamic acid substitution. 
30 These residues resemble each other in size and shape, except 

from the carboxylic groups being present on the acidic 

residues. 

In the case of providing fewer attachment groups, e.g. at or 
close to the active site, a Lysine may be substituted with a 
35 Arginine, and so on. 

Which amino acids to substitute depends in principle on the 
coupling chemistry to be applied. 
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Non-conservative substitution 

The mutation may also be on target amino acid residues which 
are less/non-conservative. Such mutation is suitable for 
obtaining a more complete and broadly spread shielding of the 
5 polypeptide surface than can be obtained by the conservative 
substitutions . 

The method of the invention is first described in general 
terms, and subsequently using specific examples. 

Note the use of the following terms: 
10 Attachmentjresidue: residue (s) which can bind polymeric 
molecules, e.g. Lysines (amino group) or Aspartic/Glutamic 
acids (carboxylic groups) . N- or C-terminal amino/carboxylic 
groups are to be included where relevant. 

Mutation_residue: residue (s) which is to be mutated, e.g. 
15 Arginine or Aspargine/Glutamine. 

Essential_catalytic_residues: residues which are known to be 
essential for catalytic function, e.g. the catalytic triad in 
Serine proteases. 

Solvent_exposed_residues: These are defined as residues which 
20 are at least 5% exposed according to the BIOSYM/INSIGHT 

algorithm found in the module Homology 95.0. The sequence of 
commands are as follows; 

Homology=>ProStat=>Access_Surf=>SolvJRadius 1.4; Heavy atoms 
only; Radii source VdW; Output: Fractional Area; Polarity 
25 source: Default. The file f ilename_area. tab is produced. Note: 
For this program to function properly all water molecules must 
first be removed from the structure. 
It looks for example like: 
# PD4 9 8 FI NALMODEL 



30 


# residue 


area 




TRP_1 


136.275711 




SER_2 


88. 188095 




PRO_3 


15.458788 




ASN_4 


95.322319 


35 


ASP_5 


4.903404 




PRO_6 


68.096909 




TYR_7 


93.333252 




TYR 8 


31.791576 
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SER_9 95.983139 
, . continued 

1. Identification of residues which are more than 10 A away 

5 from the closest attachment_residue, and which are located at 
least 8 A away from essential_catalytic_residues. This residue 
subset is called REST, and is the primary region for 
conservative mutation_residue to attachmentjresidue 
substitutions . 

10 

2. Identification of residues which are located in a 0-5 A 
shell around subset REST, but at least 8 A away from 
essential_catalytic_residues. This residue subset is called 
SUB5B. This is a secondary region for conservative 

15 mutation_residue to attachmentjresidue substitutions, as a 

ligand bound to an attachmentjresidue in SUB5B will extend into 
the REST region and potentially prevent epitope recognition. 

3. Identification of solvent_exposed mutationjresidues in REST 
20 and SUB5B as potential mutation sites for introduction of 

attachment jresidues . 

4. Use BIOSYM/INSIGHT's Biopolymer module and replace residues 
identified under action 3. 

25 

5. Repeat 1-2 above producing the subset RESTx . This subset 
includes residues which are more than 10 A away from the 
nearest attachmentjresidue, and which are located at least 8 A 
away from essential catalytic residues. 

30 

6. Identify solvent_exposed_residues in RESTx. These are 
potential sites for less/non-conservative mutations to 
introduce atttachment_ residues. 

35 

Step c) Substituting, inserting or deleting amino acid residues 

The mutation (s) performed in step c) may be performed by 
standard techniques well known in the art, such as site-directed 
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mutagenesis (see, e.g., Sambrook et al. (1989), Sambrook et al., 
Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, NY. 

A general description of nucleotide substitution can be found 
in e.g. Ford et al., 1991, Protein Expression and Purification 2, 
5 p. 95-107. 

Step d) Coupling polymer ic molecules to the modified parent enzyme 
Polypeptide-polymer conjugates of the invention may be 
prepared by any coupling method known in the art including the 
10 above mentioned techniques. 

Coupling of polymeric molecules to the polypeptide in question 

If the polymeric molecules to be conjugated with the 
polypeptide are not active it must be activated by the use of a 

15 suitable method. The polymeric molecules may be coupled to the 
polypeptide through a linker. Suitable linkers are well known to 
the skilled person. 

Methods and chemistry for activation of polymeric molecules as 
well as for conjugation of polypeptides are intensively described 

20 in the literature. Commonly used methods for activation of 
insoluble polymers include activation of functional groups with 
cyanogen bromide, periodate, glu tar aldehyde, biepoxides, 
epichlorohydrin, divinylsulf one, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991), "Protein 

25 immobilisation. Fundamental and applications", Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Crosslinking" , CRC Press, Boca Raton; G.T. Hermanson et al., 
(1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N.Y.). Some of the methods concern activation of insoluble 

30 polymers but are also applicable to activation of soluble polymers 
e.g. periodate, trichlorotriazine, sulfonylhalides, 

divinylsulfone, carbodiimide etc. The functional groups being 
amino, hydroxyl, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 

35 considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

In the following a number of suitable polymer activation 
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methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of enzymes 
can be performed with the aid of diimide and for example amino-PEG 
5 or hydrazino-PEG (Pollak et al., (1976), J. Amr. Chem. Soc. , 98, 
289-291) or diazoacetate/ amide (Wong et al., (1992), "Chemistry of 
Protein Conjugation and Crosslinking" , CRC Press). 

Coupling polymeric molecules to hydroxy groups are generally 
very difficult as it must be performed in water. Usually 
10 hydrolysis predominates over reaction with hydroxy 1 groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 
15 selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

Techniques involving coupling electrophilically activated PEGs 
to the amino groups of Lysines are also be useful. Many of the 
20 usual leaving groups for alcohols give rise to an amine linkage. 
For instance, alkyl sulfonates, such as tresylates (Nilsson et 
al., (1984), Methods in Enzymology vol. 104, Jacoby, W. B., Ed., 
Academic Press: Orlando, p. 56-66; Nilsson et al., (1987), Methods 
in Enzymology vol. 135; Mosbach, K. , Ed.; Academic Press: Orlando, 
25 pp. 65-79; Scouten et al., (1987), Methods in Enzymology vol. 135, 
Mosbach, K. , Ed., Academic Press: Orlando, 1987; pp 79-84; 
Crossland et al., (1971), J. Amr. Chem. Soc. 1971, 93, pp. 4217-4- 
219), mesylates (Harris, (1985), supra : Harris et al., (1984), J. 
Polym. Sci. Polym. Chem. Ed. 22, pp. 341-352), aryl sulfonates 
30 like tosylates, and para-nitrobenzene sulfonates can be used. 

Organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with 
nucleophiles like amino groups in polypeptides allow stable 
35 linkages to be formed between polymer and polypeptide. In addition 
to high conjugation yields, the reaction conditions are in general 
mild (neutral or slightly alkaline pH, to avoid denaturation and 
little or no disruption of activity) , and satisfy the non- 
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destructive requirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more 
unstable decomposing into PEG, dioxane, and sulfonic acid 
(Zalipsky, (1995), Bioconjugate Chem. , 6, 150-165). Epoxides may 
5 also been used for creating amine bonds but are much less reactive 
than the above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise to 
carbamate linkages to Lysines. This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
10 patent no. 5,122,614, (1992); Zalipsky et al., (1992), Biotechnol. 
Appl. Biochem., 15, p. 100-114; Monfardini et al., (1995), 
Bioconjugate Chem., 6, 62-69, with imidazole (Allen et al., 

(1991) , Carbohydr. Res., 213, pp 309-319), with para -nitrophenol, 
DMAP (EP 632 082 Al, (1993), Looze, Y.) etc. The derivatives are 

15 usually made by reacting the chloroformate with the desired 
leaving group. All these groups give rise to carbamate linkages to 
the peptide. 

Furthermore, isocyanates and isothiocyanates may be employed 
yielding ureas and thioureas, respectively. 
20 Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5 , 349 , 001 , ( 1994 ) , Greenwald et al . ) . The reactivity of these 
compounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
25 also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 

(1992) , Zalipsky). This group may be activated with N-hydroxy 
succinimide • 

Furthermore, a special linker can be introduced. The oldest 
30 being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986) , J. Polyra. Sci. Polym. Chem. Ed., 24, 
375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
35 yields a very reactive diazonium salt which in situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlactone derivative of PEG (US patent no. 5,321,095, (1994), 
Greenwald, R. B.) thus introducing an additional amide linkage. 
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As some peptides do not comprise many Lysines it may be advan- 
tageous to attach more than one PEG to the same Lysine. This can 
be done e.g. by the use of 1, 3-diamino-2-propanol. 

PEGs may also be attached to the amino-groups of the enzyme 
5 with carbamate linkages (WO 95/11924, Greenwald et al.)- Lysine 
residues may also be used as the backbone. 

Addition of attachment groups 

Specific examples of PD498 variant-SPEG conjugates 
10 A specific example of a protease is the parent PD498 (WO 
93/24623 and SEQ ID NO. 2). The parent PD498 has a molecular 
weight of 29 kDa. 

Lysine and Arginine residues are located as follows: 



Distance from the 


Arginine 


Lysine 


active site 






0-5 A 


1 




5-10 A 






10-15 A 


5 


6 


15-20 A 


2 


3 


20-25 A 


1 


3 


total 


9 


12 



15 The inventors examined which parent PD498 sites on the surface 
may be suitable for introducing additional attachment groups. 

A. Suitable conservative Arginine to Lysine substitutions in 
parent PD498 may be any of R51K, R62K f R121K, R169K, R250K, R28K, 
R190K. 

20 B. Suitable non-conservative substitutions in parent PD498 may 
be any of P6K, Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, 
N65K, G87K, I88K, N209K, A211K, N216K, N217K, G218K, Y219K, 
S220K, Y221K, G262K. 

As there is no Lysine residues at or close to the active site 

25 there is no need for removing any attachment group. 

PD498 variant-SPEG conjugates may be prepared using any of the 
above mentioned PD498 variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific example is 

30 described below. 
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Removal of attachment groups 

Specific examples of BPN~ variant-SPEG conjugates 

A specific example of a protease having an attachment group in 
5 the active site is BPN' which has 11 attachment groups (plus an N- 
terminal amino group) : BPN' has a molecular weight of 28 kDa. 

Lysine and Arginine residues are located as follows: 



Distance from 


Arginine 


Lysine 


the active site 






0-5 A 




1 


5-10 A 






10-15 A 


1 


4 


15-20 A 


1 


4 j 


20-25 A 




2 


total 


2 


11 



10 The Lysine residue located within 0-5 A of the active site can 

according to the invention advantageously be removed. Specifically 

this may be done by a K94R substitution. 

BPN' variant-SPEG conjugates may be prepared using the above 

mentioned BPN" variant as the starting material by any conjugation 
15 technique known in the art for coupling polymeric molecules to 

amino groups on the enzyme. 

Addition and removal of attachment groups 

Specific example of Savinase®-SPEG conj ugates 
20 As described in Example 2 parent Savinase® (von der Osten et 

al., (1993), Journal of Biotechnology, 28, p. 55+ and SEQ ID NO. 

3) may according to the invention have added a number of amino 

attachment groups to the surface and removed an amino attachment 

group close to the active site. 
25 Any of the following substitutions in the parent Savinase® 

are sites for mutagenesis: R10K, R19K, R45K, R145K, R170K, 

R186K and R247K. 

The substitution K94R are identified as a mutation suitable 

for preventing attachment of polymers close to active site. 
30 Savinase® variant-SPEG conjugates may be prepared using any of 
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the above mentioned Savinase® variants as the starting material by 
any conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. 

5 Addition of attachment groups 

A specific examples of Humicola lanuginosa lipase variants-SPEG 
conjugates 

Specific examples of lipase variants with reduced 
immunogenicity using the parent Huminocal lanuginosa DSM 4109 
10 lipase (see SEQ ID No 6) as the backbone for substitutions are 
listed below. 

The parent unmodified Humicola lanuginosa lipase has 8 
attachment groups including the N-terminal NH 2 group and a 
molecular weight of about 29 kDa. 
15 A. Suitable conservative Arginine to Lysine substitutions in the 
parent lipase may be any of R133K, R139K, R160K, R179K, R209K, 
R118K and R125K. 

Suitable non-conservative substitutions in the parent lipase 
may be any of: 

20 A18K f G31K / T32K,N33K f G38K f A40K / D48K r T50K / E56K,D57K f S58K,G59K, 
V60K,G61K / D62K / T64K,L78K / N88K,G91K r N92K,L93K / S105K / G106K / 
V120K,P136K,G225K,L227K,V228K,P229K,P250K,F262K. 

Further suitable non-conservative substitution in the Humicola 
lanuginosa lipase include: E87K or D254K. 

25 Lipase variant-SPEG conjugates may be prepared using any of the 
above mentioned lipase variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific example is 
described below. 

30 In Example 12 below is it shown that a conjugate of the 
Humicola lanuginosa lipase variant with a E87K+D254K substitutions 
coupled to S-PEG 15,000 has reduced immunogenic response in Balb/C 
mice in comparison to the corresponding parent unmodified enzyme. 

35 Immunogenicity and Alleraenicitv 

"Immunogenicity" is a wider term than "antigenicity" and 
"allergenicity", and expresses the immune system's response to the 
presence of foreign substances. Said foreign substances are called 
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immunogens, antigens and allergens depending of the type of immune 
response the elicit. 

An "immunogen" may be defined as a substance which, when intro- 
duced into circulatory system of animals and humans, is capable of 
5 stimulating an immunologic response resulting in formation of 
immunoglobulin. 

The term "antigen" refers to substances which by themselves are 
capable of generating antibodies when recognized as a non-self 
molecule. 

10 Further, an "allergen" may be defined as an antigen which may 
give rise to allergic sensitization or an allergic response by IgE 
antibodies (in humans, and molecules with comparable effects in 
animals) . 

15 Assessment of immunoaencitv 

Assessment of the immunogenic ity may be made by injecting 
animal subcutaneous ly to enter the immunogen into the circulation 
system and comparing the response with the response of the 
corresponding parent polypeptide. 

20 The "circulatory system" of the body of humans and animals 
means, in the context of the present invention, the system which 
mainly consists of the heart and blood vessels. The heart delivers 
the necessary energy for maintaining blood circulation in the 
vascular system. The circulation system functions as the 

25 organism's transportation system, when the blood transports 02/ 
nutritious matter, hormones, and other substances of importance 
for the cell regulation into the tissue. Further the blood removes 
C0 2 from the tissue to the lungs and residual substances to e.g. 
the kidneys. Furthermore, the blood is of importance for the 

3 0 temperature regulation and the defence mechanisms of the body, 
which include the immune system. 

A number of in vitro animal models exist for assessment of the 
immunogenic potential of polypeptides. Some of these models give a 
suitable basis for hazard assessment in man. Suitable models 

35 include a mice model. 

This model seek to identify the immunogenic response in the 
form of the IgG response in Balb/C mice being injected 
subcutaneously with modified and unmodified polypeptides. 
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Also other animal models can be used for assessment of the 
immunogenic potential. 

A polypeptide having "reduced immunogenicity" according to the 
invention indicates that the amount of produced antibodies, e.g. 
5 immunoglobulin in humans, and molecules with comparable effects in 
specific animals, which can lead to an immune response, is 
significantly decreased, when introduced into the circulatory 
system, in comparison to the corresponding parent polypeptide. 

For Balb/C mice the IgG response gives a good indication of the 
10 immunigenic potential of polypeptides. 

Assessment of alleraenicitv 

Assessment of allergenicity may be made by inhalation tests, 
comparing the effect of intratracheal ly (into the trachea) 

15 administrated parent enzymes with the corresponding modified 
enzymes according to the invention. 

A number of in vivo animal models exist for assessment of the 
allegenicity of enzymes. Some of these models give a suitable 
basis for hazard assessment in man. Suitable models include a 

20 guinea pig model and a mouse model. These models seek to identify 
respiratory allergens as a function of elicitation reactions 
induced in previously sensitised animals. According to these 
models the alleged allergens are introduced intratracheal ly into 
the animals. 

25 A suitable strain of guinea pigs, the Dunkin Hartley strain, do 
not as humans, produce IgE antibodies in connection with the 
allergic response. However, they produce another type of antibody 
the IgGIA and IgGIB (see e.gr. Prento, ATLA, 19, p. 8-14, 1991), 
which are responsible for their allergenic response to inhaled 

30 polypeptides including enzymes. Therefore, when using the Dunkin 
Hartley animal model, the relative amount of IgGIA and IgGIB is a 
measure of the allergenicity level. 

The Balb/C mice strain is suitable for intratracheal exposure. 
Balb/C mice produce IgE as the allergic response. 

35 More details on assessing respiratory allergens in guinea pigs 
and mice is described by Kimber et al.,(1996), Fundamental and 
Applied Toxicology, 33, p. 1-10. 
Other animals such as rats, rabbits etc. may also be used for 
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comparable studies. 
Composition 

The invention relates to a composition comprising a 
5 polypeptide-polymer conjugate of the invention. 

The composition may be a pharmaceutical or industrial 
composition. 

The composition may further comprise other polypeptides, 
proteins or enzymes and/or ingredients normally used in e.g. 

10 detergents, including soap bars, household articles, 
agrochemicals, personal care products, including skin care 
compositions, cleaning compositions for e.g. contact lenses, oral 
and dermal pharmaceuticals, composition use for treating textiles, 
compositions used for manufacturing food, e.g. baking, and feed 

15 etc. 

Use of the polypeptide-polvmer conjugate 

The invention also relates to the use of the method of the 
invention for reducing the immune response of polypeptides. 
20 It is also an object of the invention to use the polypeptide- 

polymer conjugate of the invention to reduce the allergenicity of 
industrial products, such as detergents, such as laundry, disk 
wash and hard surface cleaning detergents, and food or feed 
products . 

25 

MATERIAL AND METHODS 
Materials 

Enzymes: 

PD498: Protease of subtilisin type shown in WO 93/24623. The 
30 sequence of PD498 is shown in SEQ ID NO. 1 and 2. 
Savinase® (Available from Novo Nordisk A/S) 

Humicola lanuginosa lipase: Available from Novo Nordisk as 
lipolase® and is further described in EP 305,216. The DNA and 
protein sequence is shown in SEQ ID NO 5 and 6, respectively. 
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Strains: 

B . subtilis 309 and 147 are variants of Bacillus lentus , 
deposited with the NCIB and accorded the accession numbers NCIB 
5 10309 and 10147, and described in US Patent No. 3,723,250 
incorporated by reference herein. 

E. coli MC 1000 (M.J. Casadaban and S.N. Cohen (1980); J . 
Mol. Biol. 138 179-207), was made r~,m + by conventional methods 
and is also described in US Patent Application Serial No. 
10 039,298. 

Vectors ; 

pPD498: E. coli - B . subtilis shuttle vector (described in 
US patent No. 5,621,089 under section 6.2.1.6) containing the 
15 wild-type gene encoding for PD498 protease (SEQ ID NO. 2) . The 
same vector is use for mutagenesis in E. coli as well as for 
expression in B. subtilis. 

General molecular biology methods; 

2 0 Unless otherwise mentioned the DNA manipulations and 
transformations were performed using standard methods of 
molecular biology (Sambrook et al. (1989) Molecular cloning: A 
laboratory manual, Cold Spring Harbor lab. , Cold Spring Harbor, 
NY; Ausubel, F. M. et al. (eds.) "Current protocols in 

25 Molecular Biology". John Wiley and Sons, 1995; Harwood, C. R. , 
and Cutting, S. M. (eds.) "Molecular Biological Methods for 
Bacillus". John Wiley and Sons, 1990). 

Enzymes for DNA manipulations were used according to the 
specifications of the suppliers. 

30 

Materials, chemicals and solutions: 

Horse Radish Peroxidase labeled anti-rat-Ig (Dako, DK, P162, # 
031; dilution 1:1000). 
35 Mouse anti-rat IgE (Serotec MCA193 ; dilution 1:200). 
Rat anti-mouse IgE (Serotec MCA419; dilution 1:100). 
Biot in-labeled mouse anti-rat IgGl monoclonal antibody (Zymed 03- 
9140; dilution 1:1000) 
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Biotin-labeled rat anti-mouse IgGl monoclonal antibody (Serotec 
MCA336B; dilution 1:1000) 

Streptavidin-horse radish peroxidase (KirkegSrd & Perry 14-30-00; 
dilution 1: 1000) . 
5 CovaLink NH 2 plates (Nunc, Cat# 459439) 
• Cyanuric chloride (Aldrich) 
Acetone (Merck) 

Rat anti-Mouse IgGl, biotin (SeroTec, Cat# MCA336B) 
Streptavidin, peroxidase (KPL) 
10 Ortho-Phenylene-diamine (OPD) (Kem-en-Tec) 
H 2 0 2 , 30% (Merck) 
Tween 20 (Merck) 
Skim Milk powder (Difco) 
H 2 S0 4 (Merck) 

15 

Buffers and Solutions: 

Carbonate buffer (0.1 M, pH 10 (1 liter)) Na 2 C0 3 10.60 g 

PBS (pH 7.2 (1 liter)) NaCl 8.00 g 

KC1 0.20 g 

20 K 2 HP0 4 1.04 g 

KH 2 P0 4 0.32 g 

Washing buffer PBS, 0.05% (v/v) Tween 20 
Blocking buffer PBS, 2% (wt/v) Skim Milk powder 

Dilution buffer PBS, 0.05% (v/v) Tween 20, 0.5% (wt/v) Skim Milk 
25 powder 

Citrate buffer (0.1M, pH 5.0-5.2 (1 liter) )NaCitrate 20.60 g 

Citric acid 6.30 g 

Activation of CovaLink plates: 

• Make a fresh stock solution of 10 mg cyanuric chloride per ml 
30 acetone. 

• Just before use, dilute the cyanuric chloride stock solution 
into PBS, while stirring, to a final concentration of lmg/ml. 

• Add 100 ml of the dilution to each well of the CovaLink NH2 
plates, and incubate for 5 minutes at room temperature. 

35 • Wash 3 times with PBS. 

• Dry the freshly prepared activated plates at 50 °C for 30 
minutes . 

• Immediately seal each plate with sealing tape. 
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• Preactivated plates can be stored at room temperature for 3 
weeks when kept in a plastic bag. 

Sodium Borate, borax (Sigma) 
5 3,3-Dimethyl glutaric acid (Sigma) 
CaCl 2 (Sigma) 

Tresyl chloride (2,2,2-triflouroethansulfonyl chloride) (Fluka) 
l-ethyl-3- (3-dimethylaminopropyl) carbodiiraide (EDC) (Fluka) 
N-Hydroxy succinimide (Fluka art. 56480)) 
10 Phosgene (Fluka art. 79380) 
Lactose (Merck 7656) 

PMSF (phenyl methyl sulfonyl flouride) from Sigma 
Succinyl-Alanine-Alanine-Proline-Phenylalanine-para-nitroanilide 
(Suc-AAPF-pNP) Sigma no. S-7388, Mw 624.6 g/mole. 

15 

Colouring substrate; 

OPD: o-phenylene-diamine, (Kementec cat no. 4260) 
Test Animals: 

20 Dunkin Hartley guinea pigs (from Charles River, DE) 

Female Balb/C mice (about 20 grams) purchased from Bomholdtgaard, 
Ry, Denmark. 

Equipment: 
25 XCEL II (Novex) 

ELISA reader (UVmax, Molecular Devices) 
HPLC (Waters) 
PFLC (Pharmacia) 

Superdex-75 column, Mono-Q, Mono S from Pharmacia, SW, 
30 SLT: Fotometer from SLT Lablnstruments 

Size-exclusion chromatograph (Spherogel TSK-G2000 SW) . 
Size-exclusion chromatograph (Superdex 200, Pharmacia, SW) 
Amicon Cell 

35 Enzymes for DNA manipulations 

Unless otherwise mentioned all enzymes for DNA 
manipulations, such as e.gr. restriction endonucleases, ligases 
etc., are obtained from New England Biolabs. Inc. 
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Methods 

ELISA procedure for determination of IgG^ positive guinea pigs 

ELISA microtiter plates are coated with rabbit anti-PD498 
5 1:8000 in carbonate buffer and incubated over night at 4°C. The 
next day the plates is blocked with 2% BSA for 1 hour and washes 3 
times with PBS Tween 20. 

1 iig/ml PD498 is added to the plates and incubated for 1 hour, 
then washed 3 times with PBS Tween 20. 
10 All guinea pig sera samples and controls are applied to the 

ELISA plates with 2 \il sera and 98 |il PBS, incubated for 1 hour 
and washed 3 times with PBS Tween 20. 

Then goat anti-guinea pig IgGx (1:4000 in PBS buffer (Nordic 
Immunology 44-682)) is applied to the plates, incubated for 1 hour 
15 and washed with PBS tween 20. 

Alkaline phosphatase marked rabbit anti-goat 1:8000 (Sigma 
A4187) is applied and incubated for 1 hour, washed 2 times in PBS 
Tween20 and 1 time with diethanol amine buffer. 

The marked alkaline phosphatase is developed using p- 
20 nitrophenyl phosphate for 30 minutes at 37°C or until appropriate 
colour has developed. 

The reaction is stopped using Stop medium (K2HP0 4 /HaH3 buffer 
comprising EDTA (pH 10)) and read at OD 405/650 using a ELISA 
reader . 

25 Double blinds are included on all ELISA plates. 

Positive and negative sera values are calculated as the 
average blind values added 2 times the standard deviation. This 
gives an accuracy of 95%. 

30 Determination of the molecule weight 

Electrophoretic separation of proteins was performed by standard 
methods using 4-20% gradient SDS poly acrylamide gels (Novex) . 
Proteins were detected by silver staining. The molecule weight was 
measured relative to the mobility of Mark-12® wide range molecule 

3 5 weight standards from Novex. 



Protease activity 
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Analysis with Suc-Ala-Ala-Pro-Phe-pNa: 

Proteases cleave the bond between the peptide and p- 
nitroaniline to give a visible yellow colour absorbing at 405 nm. 

Buffer: e.g. Britton and Robinson buffer pH 8.3 
5 Substrate: 100 mg suc-AAPF-pNa is dissolved into 1 ml dimethyl 
sulfoxide (DMSO) . 100 \il of this is diluted into 10 ml with 
Britton and Robinson buffer. 

The substrate and protease solution is mixed and the 
absorbance is monitored at 405 nm as a function of time and ABS405 
10 nm/min. The temperature should be controlled (20-50°C depending on 
protease) . This is a measure of the protease activity in the 
sample. 

Proteolytic Activity 

15 In the context of this invention proteolytic activity is 

expressed in Kilo NOVO Protease Units (KNPU) . The activity is 
determined relatively to an enzyme standard (SAVINASE_) , and 
the determination is based on the digestion of a dimethyl 
casein (DMC) solution by the proteolytic enzyme at standard 

20 conditions, i.e. 50°C, pH 8.3, 9 min. reaction time, 3 min. 
measuring time. A folder AF 220/1 is available upon request to 
Novo Nordisk A/S, Denmark, which folder is hereby included by 
reference. 

A GU is a Glycine Unit, defined as the proteolytic enzyme 
25 activity which, under standard conditions, during a 15-minutes 1 
incubation at 40°C, with N-acetyl casein as substrate, produces 
an amount of NH2-group equivalent to 1 mmole of glycine. 

Enzyme activity can also be measured using the PNA assay, 
according to reaction with the soluble substrate succinyl- 
30 alanine-alanine-proline-phenyl-alanine-para-nitrophenol, which 
is described in the Journal of American Oil Chemists Society, 
Rothgeb , T.M. , Goodlander , B . D . , Garrison , P . H . , and Smith , 
L.A. , (1988). 

35 Fermentation of PD498 variants 

Fermentation of PD498 variants in J5. subtilis are performed 
at 30°C on a rotary shaking table (300 r.p.m.) in 500 ml baffled 
Erlenmeyer flasks containing 100 ml BPX medium for 5 days. In 
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order to make an e.g. 2 liter broth 20 Erlenmeyer flasks are 
fermented simultaneously. 

Media: 

5 BPX: Composition (per liter) 



Sodium caseinate lOg 

The starch in the medium is liquefied with a-amylase and 
the medium is sterilized by heating at 120°C for 45 minutes. 
After sterilization the pH of the medium is adjusted to 9 by 
15 addition of NaHC0 3 to 0.1 M. 

Purification of PD498 variants 

Approximately 1.6 litres of PD498 variant fermentation 
broth are centrifuged at 5000 rpm for 35 minutes in 1 litre 

20 beakers. The supernatants are adjusted to pH 7.0 using 10% 
acetic acid and filtered on Seitz Supra S100 filter plates. 
The filtrates are concentrated to approximately 400 ml using an 
Amicon CH2A UF unit equipped with an Amicon S1Y10 UF cartridge. 
The UF concentrate is centrifuged and filtered prior to 

25 absorption at room temperature on a Bacitracin affinity column 
at pH 7. The PD498 variant is eluted from the Bacitracin column 
at room temperature using 25% 2-propanol and 1 M sodium 
chloride in a buffer solution with 0.01 dime-thyl-glutaric 
acid, 0.1 M boric acid and 0.002 M calcium chloride adjusted to 

30 pH 7. 

The fractions with protease activity from the Bacitracin 
purification step are combined and applied to a 750 ml Sephadex 
G25 column (5 cm diameter) equilibrated with a buffer 
containing 0.01 dimethylglutaric acid, 0.1 M boric acid and 
35 0.002 M calcium chloride adjusted to pH 6.0. 

Fractions with proteolytic activity from the Sephadex G25 
column are combined and applied to a 150 ml CM Sepharose CL 6B 
cat-ion exchange column (5 cm diameter) equilibrated with a 



Potato starch 



lOOg 



10 



Ground barley 
Soybean flour 
Na 2 HP0 4 X 12 H 2 0 
Pluronic 



50g 
20g 
9g 

O.lg 
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buffer containing 0.01 M dimethylglutaric acid, 0.1 M boric 
acid, and 0.002 M calcium chloride adjusted to pH 6.0, 
The protease is eluted using a linear gradient of 0-0.5 M 
sodium chloride in 1 litres of the same buffer. 
5 Protease containing fractions from the CM Sepharose column are 
combined and filtered through a 2\i filter. 

^alb/C mice IaG ELISA Procedure; 

• The antigen is diluted to 1 mg/ml in carbonate buffer. 
10 • 100 ml is added to each well. 

• The plates are coated overnight at 4°C. 

• Unspecific adsorption is blocked by incubating each well for 1 
hour at room temperature with 200 ml blocking buffer. 

• The plates are washed 3x with 300 ml washing buffer. 

15 • Unknown mouse sera are diluted in dilution buffer, typically 
lOx, 2 Ox and 4 Ox, or higher. 

• 100 ml is added to each well. 

• Incubation is for 1 hour at room temperature. 

• Unbound material is removed by washing 3x with washing buffer. 
20 ■ The anti-Mouse IgGl antibody is diluted 2000x in dilution 

buffer. 

• 100 ml is added to each well. 

• Incubation is for 1 hour at room temperature. 

• Unbound material is removed by washing 3x with washing buffer. 
25 • Streptavidine is diluted 1000X in dilution buffer. 

• 100 ml is added to each well. 

• Incubation is for 1 hour at room temperature. 

• Unbound material is removed by washing 3x with 300 ml washing 
buffer. 

30 • OPD (0.6 mg/ml) and H 2 0 2 (0.4 ml/ml) is dissolved in citrate 
buffer. 

• 100 ml is added to each well. 

• Incubation is for 10 minutes at room temperature. 

• The reaction is stopped by adding 100 ml H 2 S0 4 . 

35 • The plates are read at 492 nm with 620 nm as reference. 

Immunisation of mice 

Balb/C mice (20 grams) are immunised 10 times (intervals of 14 
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days) by subcutaneous injection of the modified or unmodified 
polypeptide in question, respectively by standard proceedures 
known in art. 



5 EXAMPLES 
Example 1 

Suitable substitut ions in PD498 for addition of 3mjno 
10 attachment groups 

The 3D structure of parent PD498 was modeled as described 
above based on 59% sequence identity with Thermitase® 
(2tec.pdb) . 

The sequence of PD498 is (see SEQ ID NO, 2). PD498 residue 
15 numbering is used, 1-280. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and makeKzone2 .bcl below: 



Conservative substitutions: 

20 makeKzone.bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 55,0,255 

3 Zone Subset LYS :lys:NZ Static monomer/residue 10 
Color_Subset 255,255,0 

25 4 Zone Subset NTERM :1:N Static monomer /residue 10 
Color_Subset 255,255,0 

5 #N0TE: editnextline ACTSITE residues according to the 
protein 

6 Zone Subset ACTSITE : 39, 72, 226 Static monomer/residue 8 
30 Color_Subset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #NOTE: editnextline object name according to the protein 

10 Combine Subset REST Difference PD498FINALMODEL ALLZONE 
35 11 List Subset REST Atom Output File restatom. list 

12 List Subset REST monomer /residue Output_File restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 

14 List Subset ACTSITE Atom Output File actsiteatom. list 

15 List Subset ACTSITE monomer/ residue OutputJFile 
40 actsitemole. list 

16 # 

17 Zone Subset REST5A REST Static Monomer /Residue 5 - 
Color Subset 

18 Combine Subset SUB5A Difference REST5A ACTSITE 
45 19 Combine Subset SUB5B Difference SUB 5 A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255,255,255 

21 List Subset SUB5B Atom Output File sub5batom. list 

22 List Subset SUB5B monomer /residue Output_File subSbmole. list 
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23 #Now identify sites for lys->arg substitutions and continue 
with makezone2 .bcl 

24 #Use grep command to identify ARG in restatom. list, 
sub5 bat om. list & accsiteatom. list 

5 

Comments : 

Lines 1-8: The subset ALLZONE is defined as those residues 
which are either within 10 A of the free amino groups on 
lysines or the N-terminal, or within 8 A of the catalytic triad 
10 residues 39, 72 and 226. 

Line 10: The subset REST is defined as those residues not 
included in ALLZONE. 

Lines 17-20: Subset SUB5B is defined as those residues in a 
5 A shell around REST, excluding residues within 8 A of the 
15 catalytic residues. 

Line 23-24: REST contains Arg62 and Argl69, SUB5B contains 
Arg51 f Argl21, and Arg250. ACTSITE contains Argl03, but 
position 103 is within 8 A from essential_catalytic_residues, 
and thus not relevant. 
20 The colour codes are: (255,0,255) = magenta, 

(255,255,0)yellow, (255,0,0) red, and (255, 255, 255)= white. 

The substitutions R51K, R62K, R121K, R169K and R250K are 
identified in parent PD498 as suitable sites for mutagenesis. 
The residues are substituted below in section 2, and further 
25 analysis done: 



Non-conservative substitutions: 
makeKzone2 .bcl 

I #sourcefile make zone 2. bcl Claus von der Osten 961128 
30 2 # 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #N0TE: editnextline object name according to protein 

5 Copy Object -To_Clipboard -Displace PD4 9 8 FINALMODEL 
35 newmodel 

6 Biopolymer 

7 #N0TE: editnextline object name according to protein 

8 Blank Object On PD4 9 8 FINALMODEL 

9 #NOTE : editnextlines with lys->arg positions 
40 10 Replace Residue newmodel: 51 lys L 

II Replace Residue newmodel: 62 lys L 

12 Replace Residue newmodel: 121 lys L 

13 Replace Residue newmodel: 169 lys L 

14 Replace Residue newmodel: 250 lys L 
45 15 # 
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16 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

17 Color Molecule Atoms newmodel Specified Specification 
255, 0,255 

5 18 Zone Subset LYSx newmodel: lys:NZ Static monomer/ residue 10 
Color_Subset 255,255,0 

19 Zone Subset NTERMx newmodel: 1:N Static monomer/residue 10 
Color_Subset 255,255,0 

20 #NOTE: editnextline ACTSITEx residues according to the 
10 protein 

21 Zone Subset ACTSITEx newmodel: 39, 72, 22 6 Static 
monomer/ residue 8 Color_Subset 255,255,0 

22 Combine Subset ALLZONEx Union LYSx NTERMx 

23 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 24 Combine Subset RESTx Difference newmodel ALLZONEx 

25 List Subset RESTx Atom Output_File restxatom. list 
2 6 List Subset RESTx monomer/residue Output_File 
restxmole. list 
27 # 

20 28 Color Molecule Atoms ACTSITEx Specified Specification 
255,0,0 

29 List Subset ACTSITEx Atom Output File actsitexatom. list 

30 List Subset ACTSITEx monomer/ residue Output_File 
actsitexmole. list 

25 31 # 

32 #read restxatom. list or restxmole. list to identify sites 
for (not_arg) ->lys subst. if needed 

Comments : 

30 Lines 1-15: Solvent exposed arginines in subsets REST and 

SUB5B are replaced by lysines. Solvent accessibilities are 

recalculated following arginine replacement. 

Lines 16-23: The subset ALLZONEx is defined as those 

residues which are either within 10 A of the free amino groups 
35 on Lysines (after replacement) or the N-terminal, or within 8 A 

of the catalytic triad residues 39 , 72 and 226. 

Line 24-26: The subset RESTx is defined as those residues 

not included in ALLZONEx, i.e. residues which are still 

potential epitope contributors. Of the residues in RESTx, the 
40 following are >5% exposed (see lists below): 6-7,9-12,43- 

45,65,87-88,209,211,216-221,262. 

The following mutations are proposed in parent PD498: P6K, 

Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, G87K, I88K, 

N209K, A211K, N216K, N217K, G218K, Y219K, S220K, Y221K, G262K. 
45 Relevant data for Example 1: 

Solvent accessibility data for PD498MODEL: 

# PD498MODEL Fri Nov 29 10:24:48 MET 1996 

# residue area 
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TRP 


1 


136.275711 




SER 
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88.188095 




pro" 


'3 


15.458788 




ASN 
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95.322319 
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ASP" 


5 
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PRO 
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tyr" 
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SER 
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10 
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"10 
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15 
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15 
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41.031750 


20 


thr" 
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4.321402 
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16.658991 
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42.107288 




ALA" 


23 


0.000000 




TRP" 
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3.713619 


25 


ASP" 


"25 


82.645493 




VAL 26 


74.397812 




THR 27 


14.950654 
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28 


110.606209 




GLY" 


"29 
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SER" 


"30 


57.225292 
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1.928865 




GLN" 
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"34 
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ALA" 
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val" 
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"38 


1.550332 




asp" 


"39 


3.585718 


40 


ser" 


"40 


2,475746 
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"41 


4.329043 




VAL" 


"42 


1.704864 




asp"* 


"43 


25.889742 




tyr" 


"44 


89.194855 


45 


asn" 


"45 


109.981819 
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"46 


0.268693 
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"47 


66.580925 
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49.618046 
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53 
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98.478104 


55 


LYS" 
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103.612228 




GLY" 
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17.199390 




tyr" 
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58 


0.000000 
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59 


40.291119 
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60 


50.151962 
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"61 


70.078888 


5 
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166.777557 
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120.641953 
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58.504269 
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68 


28.668840 




LEtT 


69 
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'70 


78.460953 




GLY 


71 


5.615932 


15 


HIS 


72 


43.158905 




GLY" 


73 


0.268693 
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0.000000 




his" 


"75 


0.484127 
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76 


1.880854 
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ALA 


77 


0.000000 




GLY" 


78 


0.933982 




THR 
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9.589676 




VAL~ 
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0.000000 




ALA 


81 


0.000000 


25 


ALA* 


82 


0.000000 




ASP" 


~83 


46.244987 




THR" 


"84 


27.783333 




asn" 


"85 


75.924225 




ASN™ 


"86 


44.813908 


30 


GLY" 


"87 


50.453152 




ile" 


"88 


74.428070 




gly" 


"89 


4.115077 




val" 


"90 


6.717335 




ALA*" 


"91 


2.872341 


35 


gly" 


"92 


0.233495 




met" 


"93 


5.876057 






"94 


0.000000 




pro" 


[95 


17.682203 




asp" 


[96 


83 . 431740 


40 


THR~ 


[97 


1.506567 




lys" 


"98 


72.674973 




ile" 


[99 


4.251006 




leu" 


"100 


6.717335 




ALA 


"101 


0.806080 


45 


val" 


"102 


1.426676 




arg" 


"103 


2.662697 




val" 


"104 


2.171855 




leu" 


"105 


18.808538 




asp" 


"106 


52. 167435 


50 


ALA" 


"107 


52.905663 




asn" 


"108 


115.871315 




gly" 


"109 


30.943356 




ser 110 


57.933651 




GLY 111 


50.705326 


55 


SER 


112 


56.383320 




leu" 


"113 


71.312195 




asp" 


"114 


110.410919 
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SER 115 


13.910152 




ILE 116 


22.570246 




ALA 
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5.642561 




SER 


118 


29.313131 


5 


GLY 119 


0.000000 




ILE 120 


1.343467 




ARG 121 


118.391129 




TYR 
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44.203033 




ALA 


123 


0.000000 


10 


ALA 
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7.974043 




ASP 
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83.851639 




GLN 
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64.311974 
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15 


LYS 
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1.039576 




LEU 


131 


2.149547 




ASN 
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LEU 


133 


1.880854 
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SER- 
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LEU 
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137 
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25 
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ASN 
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SER 


141 


25.899158 




THR 


142 
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THR~ 
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30 


LEU~ 
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LYS_ 
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SER- 


146 
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ALA_ 


147 


9.235920 




VAL_ 


148 


1.612160 


35 


ASP_ 


149 


57.431465 




TYR_ 


150 


106.352493 




ALA~ 
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0.268693 
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ASN 
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40 


LYS 
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45 


VAL 


159 


0.000000 




ALA 


160 


0.537387 




ALA 


161 


10.872165 




ALA 


162 


7.823834 




GLY- 


163 


12. 064573 


DU 


ASN" 


164 


Ol 1 OOIQO 
ol . lOJJOO 




ASP 


165 


64.495300 




ASN 


166 


83.457443 




VAL 167 


68.516815 




SER 168 


78.799652 
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170 


57.275074 




PHE 


171 


51.416462 



WO 98/35026 



PCT/DK98/00046 



47 





GLN_ 


172 


18 . 934589 




PRO_ 


173 


1. 880854 




ALA__ 


174 


6. 522357 




SER_ 


175 


26.184139 


5 


TYR 


176 


21.425076 




PRO 


177 


85. 613541 




ASN_ 


178 


34.700817 




ALA" 


179 


0.268693 




ILE 


180 


1. 074774 


10 


ALA 


181 


3 .761708 




VAL_ 


182 


0. 000000 




GLY_ 


183 


2 . 149547 




ALA_ 
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"193 


96.223808 




PHE 


"194 


51.482613 




SER_ 


[195 


1.400973 


25 


ASN_ 


[196 


4 . 148808 




TYR_ 


[l97 


80. 937309 




GLY~ 


198 


10.747736 




THR_ 
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205 


0 . 000000 


35 


PRO" 


"206 


0. 000000 
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[208 
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40 
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214 
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"226 


17.432346 




MET 


"227 


7.233279 




ALA" 


"228 


0.000000 



- WO 98/35026 



PC1YDK98/00046 



48 



SER_229 
PRO_230 
HIS_231 
VAL_232 
5 ALA_233 
GLY_234 
LEU_235 
ALAJ236 
ALA_237 

10 LEU_238 
LEU_239 
ALA_240 
SER_241 
GLN_242 

15 GLYJ243 
LYS_244 
ASN_245 
ASN_246 
VAL_247 

20 GLN_248 
ILE_249 
ARG_250 
GLNJ251 
ALA_252 

25 ILE_253 
GLU_254 
GLN_255 
THR_256 
ALA_257 

30 ASP_258 
LYS_259 
ILE_260 
SER_261 
GLY_262 

35 THR_263 
GLY_264 
THR_265 
ASN_266 
PHE_267 

40 LYS_268 
TYR_269 
GLY_270 
LYS_271 
ILE_272 

45 ASN_273 
SER_274 
ASN_275 
LYS_276 
ALA_277 

50 VAL_278 
ARG_279 
TYR_280 
CA_281 
CA_282 

55 CA 283 



0.000000 
0.268693 
2.680759 
0.000000 
0.000000 

I. 074774 

II. 500556 
0.000000 
0.000000 

I. 612160 
0.000000 
10.648088 
39.138004 
71.056175 
66.487144 
43.256012 
80.728127 
34.859673 
84.145645 
51.819775 
8.598188 
35.055809 
71.928093 
0.000000 
4.845899 
13.344438 
81.705254 
9.836061 
2.810513 
44.656136 
113.071686 
32.089527 
91.590103 
26.450439 
38.308762 
46.870056 
88.551804 
34.698349 
7.756911 
103.212852 
37.638382 
0.000000 

II. 376978 
2.885231 
19.195255 
2.651736 
38.177547 
84.549576 
1.074774 
4.775503 
162.693054 
96.572929 
0.000000 
0.000000 
8.803203 



Subset REST: 
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restmole. list 
Subset REST* 

PD498FINALMODEL: 6-7 , 9-12 , 43-46 , 61-63 , 65 , 87- 
89 , 111-114, 117-118 ,131, 
5 PD4 9 8 FINALMODEL : 137-139 , 158-159 , 169-171 , 173- 
174,180-181,209,211, 

PD4 9 8 FINALMODEL: 216-221, 232-233, 262, E282H 

restatom.list 
Subset REST: 
10 PD4 9 8 FINALMODEL: PRO 6:N,CA,CD,C,0,CB,CG 

PD4 9 8 FINALMODEL : TYR 7 : N , OA, C, O , CB , CG , GDI , CD2 , CE1 , CE2 , CZ , 
PD498FINALM0DEL: SER 9:N,CA,C,0,CB,0G 
PD4 9 8 FINALMODEL: ALA 10:N,CA,C,O,CB 

PD 4 9 8 FINALMODEL : TYR 11:N,CA,C,0,CB, CG, CD1,CD2 ,CE1,CE2 , CZ 
15 PD4 9 8 FINALMODEL : GLN 12 :N, CA, C, 0, CB, CG, CD, 0E1 ,NE2 

PD4 9 8 FINALMODEL : ASP 4 3 : N , CA , C , O , CB , CG , 0D1 , 0D2 
PD4 9 8 FINALMODEL : TYR 

44:N,CA,C,0,CB,CG,CD1,CD2,CE1 / CE2,CZ,0H 
PD49 8 FINALMODEL : ASN 4 5 : N , CA , C , O , CB , CG , 0D1 , ND2 
2 0 PD4 9 8 FINALMODEL : HIS 

46:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8FINALM0DEL : ASP 61 : N, CA, C, 0, CB, CG, 0D1 , 0D2 
PD4 9 8 FINALMODEL : ARG 
62:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
2 5 PD4 9 8FINALM0DEL : ASP 6 3 : N , CA , C , O , CB , CG , 0D1 , 0D2 

PD4 9 8 FINALMODEL : ASN 65:N,CA,C,0,CB,CG,0D1,ND2 
PD4 9 8 FINALMODEL :GLY 87:N,CA,C,0 

PD4 9 8FINALM0DEL : ILE 8 8 : N , CA , C , O , CB , CGI , CG2 , CD1 

PD4 9 8 FINALMODEL : GLY 89:N,CA,C,0 
30 PD4 9 8 FINALMODEL: GLY 111:N,CA,C,0 

PD4 9 8 FINALMODEL : SER 1 12 : N , CA , C , O , CB , OG 

PD498 FINALMODEL : LEU 1 1 3 : N , CA , C , O , CB , CG , CD1 , CD 2 

PD4 9 8FINALM0DEL : ASP 1 1 4 : N , CA , C , O , CB , CG , 0D1 , OD2 

PD 4 9 8 FINALMODEL : ALA 117 :N, CA, C,0, CB 
35 PD4 9 8 FINALMODEL: SER 118:N,CA,C,0,CB,0G 

PD4 9 8 FINALMODEL: LEU 131:N,CA,C,0,CB,CG,CD1,CD2 

PD4 9 8 FINALMODEL : CYS 1 3 7 : N , CA , C , O , CB , SG 

PD4 9 8 FINALMODEL : GLU 
138:N,CA,C,0,CB,CG,CD,OEl,OE2 
40 PD498FINALMODEL: CYS 139 :N, CA, C,0, CB, SG 

PD4 9 8 FINALMODEL : VAL 1 5 8 : N , CA , C , O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : VAL 15 9 : N , CA , C , O , CB , CGI , CG2 

PD4 9 8 F I N ALMODEL : ARG 
169:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
45 PD4 9 8 FINALMODEL :THR 170:N,CA, C,0,CB,0G1,CG2 

PD49 8 FINALMODEL : PHE 
171:N,CA,C,0,CB,CG,CD1,CD2, CE1, CE2 , CZ 

PD49 8 FINALMODEL : PRO 1 7 3 : N , CA , CD , C , O , CB , CG 

PD4 9 8 FINALMODEL : ALA 17 4 : N , CA , C , O , CB 
50 PD498FINALMODEL: ILE 180 : N, CA, C,0, CB, CGI , CG2 , CD1 

PD4 9 8 FINALMODEL : ALA 181:N,CA,C,0,CB 

PD498 FINALMODEL : ASN 2 0 9 : N , CA , C , 0 , CB , CG , 0D1 , ND2 

PD4 9 8 FINALMODEL : ALA 211:N,CA,C,0,CB 

PD4 9 8 FINALMODEL: ASN 216 :N, CA, C, O, CB, CG, 0D1 , ND2 
55 PD4 9 8 FINALMODEL : ASN 2 17 : N , CA , C , 0 , CB , CG , 0D1 , ND2 

PD4 9 8 FINALMODEL: GLY 218:N,CA,C,0 



WO 98/35026 



50 



PCT/DK98/00046 



PD4 98 FINALMODEL : TYR 

2 19 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
PD4 9 8 FINALMODEL: SER 220 :N f CA,C,0, CB f OG 
PD4 9 8 FINALMODEL : TYR 
5 221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

PD 4 9 8 F I N ALMODEL : VAL 232:N,CA ,0,0,08,001,062 
PD4 9 8 FINALMODEL : ALA 233 :N, CA, 0,0, CB 
PD4 9 8 FINALMODEL : GLY 262:N,CA,C,0 
PD4 9 8 FINALMODEL : CA E282H:CA 

10 

Subset SUB5B: 

sub5bmole. list 
Subset SUB5B: 

PD498FINALMODEL: 4-5, 8, 13-16, 34-35, 47- 
15 51,53,64,83,85-86,90-91,120-124, 

PD4 9 8 FINALMODEL: 128-130 , 140-14 1 , 143-144 , 147- 
148 , 151-152 , 156-157 , 

PD4 9 8 FINALMODEL: 165, 167-168 , 172 , 175-176 , 178- 
179, 196,200-205,208, 
20 PD498FINALMODEL: 234-237, 250, 253-254, 260-2 61, 263- 

267,272, E281H, 

PD498 FINALMODEL : E2 8 3H 

sub5batom. list 

25 Subset SUB5B: 

PD4 9 8 FINALMODEL : ASN 4 : N , CA , C , O , CB , CG , 0D1 , ND2 
PD498 FINALMODEL : ASP 5 : N , CA , C , O , CB , CG , OD 1 , 0D2 
PD4 9 8 FINALMODEL : TYR 
8:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

3 0 PD4 9 8 FINALMODEL : TYR 

13:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL: GLY 14:N,CA,C,0 
PD4 9 8 FINALMODEL: PRO 15 : N , CA, CD , C , O , CB , CG 
PD4 98 FINALMODEL : GLN 1 6 : N , CA , C , O , CB , CG , CD , 0E1 , NE2 

35 PD4 9 8 FINALMODEL :THR 34 :N, CA, C, O, CB, 0G1 , CG2 

PD4 9 8 FINALMODEL : VAL 3 5 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL : PRO 4 7 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL : ASP 4 8 : N , CA , C , O , CB , CG , OD 1 , 0D2 
PD4 9 8 FINALMODEL : LEU 4 9 : N , CA , C , O , CB , CG , CD 1 , CD2 

40 PD 4 9 8 F IN ALMODEL : ALA 50:N,CA,C,O,CB 

PD498 FINALMODEL : ARG 

51:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 FINALMODEL : VAL 5 3 : N , CA , C , O , CB , CG 1 , CG2 
PD4 9 8 FINALMODEL : ASN 6 4 : N , CA , C , O , CB , CG , OD1 , ND2 

45 PD498FINALMODEL: ASP 83 :N,CA,C,0, CB, CG, OD1 , OD2 

PD4 9 8 FINALMODEL : ASN 85 : N , CA , C , O , CB , CG , 0D1 , ND2 
PD4 9 8 FINALMODEL : ASN 86:N,CA,C,0,CB,CG,0D1,ND2 
PD4 9 8 FINALMODEL : VAL 9 0 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL : ALA 91:N, CA, C,0, CB 

50 PD4 9 8 FINALMODEL : ILE 120 : N, CA, C , O , CB , CGI , CG2 , CD1 

PD4 9 8 FINALMODEL : ARG 

121:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD49 8 FINALMODEL : TYR 
122:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

55 PD4 9 8 FINALMODEL: ALA 123 : N , CA, C,0 , CB 

PD49 8 FINALMODEL : ALA 12 4 : N , CA , C , O , CB 
PD498 FINALMODEL : ALA 12 8 : N , CA , C , O , CB 
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PD498FINALMODEL: LYS 129 :N, CA, C, O, CB, CG, CD, CE, NZ 
PD4 9 8 FINALMODEL : VAL 1 3 0 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL : ASN 14 0 : N , CA , C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL : SER 1 4 1 : N , CA , C , O , CB , OG 
5 PD4 9 8 F INALMODEL : THR 14 3 : N , CA , C , O , CB , OG1 , CG2 

PD 498 F IN ALMODEL : LEU 14 4 : N , CA , C , O , CB , CG , CD1 , CD2 
PD4 9 8 FINALMODEL: ALA 147 :N, CA, C, O, CB 
PD4 9 8 FINALMODEL : VAL 1 4 8 : N , CA , C , O , CB , CG 1 , CG2 
PD4 9 8 F INALMODEL : ALA 151 :N, CA, C, O, CB 

10 PD498F INALMODEL : TRP 

52:N,CA,C / 0,CB,CG,CD1,CD2,NE1,CE2,CE3, 
CZ2,CZ3,CH2 
PD4 9 8 FINALMODEL : ALA 15 6 : N , CA , C , O , CB 
PD 4 9 8 FINALMODEL : VAL 157 : N , CA , C , O , CB , CGI , CG2 

15 PD4 9 8 FINALMODEL : ASP 165 :N,CA,C,0, CB, CG,ODl,OD2 

PD4 9 8 FINALMODEL : VAL 1 67 : N , CA , C , O , CB , CGI , CG2 
PD 4 9 8 FINALMODEL : SER 168 : N , CA , C , O , CB , OG 
PD4 9 8 FINALMODEL : GLN 

172:N,CA,C,0,CB,CG, CD,OEl,NE2 

20 PD4 9 8 FINALMODEL: SER 175:N f CA,C,0, CB,OG 

PD498 FINALMODEL : TYR 

17 6:N,CA,C,0,CB,CG,CD1,CD2,CE1 / CE2, CZ,OH 
PD498 FINALMODEL : ASN 1 7 8 : N , CA , C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL : ALA 179 : N, CA, C, 0, CB 

25 PD4 9 8 FINALMODEL: ASN 196 : N, CA, C, O, CB, CG, OD1 , ND2 

PD4 9 8 F I N ALMOD EL : TRP 

200:N,CA,C / O,CB,CG,CDl,CD2,NEl,CE2,CE3, 
CZ2,CZ3,CH2 
PD4 9 8 FINALMODEL : VAL 201 :N, CA, C,0, CB, CGI , CG2 

30 PD 4 9 8 FINALMODEL : ASP 202 :N,CA,C,0,CB,CG,ODl,OD2 

PD4 9 8 FINALMODEL : VAL 2 03 : N , CA , C , 0 , CB , CGI , CG2 
PD4 9 8 FINALMODEL : THR 2 0 4 : N , CA , C , O , CB , OG1 , CG2 
PD4 9 8 FINALMODEL : ALA 205 :N, CA, C, O, CB 
PD4 9 8 FINALMODEL : VAL 2 0 8 : N , CA , C , 0 , CB , CGI , CG2 

35 PD4 9 8 FINALMODEL :GLY 234:N,CA,C,0 

PD4 9 8 FINALMODEL: LEU 2 3 5 : N , CA , C , O , CB , CG , CD1 , CD2 
PD4 9 8 FINALMODEL: ALA 236:N / CA r C,0,CB 
PD4 9 8 FINALMODEL : ALA 2 3 7 : N , CA , C , O , CB 
PD498 FINALMODEL : ARG 

40 250:N,CA,C,O,CB,CG,CD,NE,CZ,NHl,NH2 

PD4 9 8 FINALMODEL : ILE 2 5 3 : N , CA , C , O , CB , CGI , CG2 , CD1 
PD4 9 8 FINALMODEL : GLU 

254:N,CA,C,0,CB,CG,CD,0E1,0E2 
PD4 9 8 FINALMODEL : ILE 2 6 0 : N , CA , C , O , CB , CGI , CG2 , CD1 

45 PD4 9 8 FINALMODEL: SER 261:N,CA,C,0,CB,OG 

PD4 9 8 FINALMODEL: THR 263 : N , CA, C , 0 , CB , OG1 , CG2 
PD4 9 8 FINALMODEL :GLY 264:N,CA,C,0 
PD4 9 8 FINALMODEL : THR 265 : N , CA , C , 0 , CB , OG1 , CG2 
PD4 9 8 FINALMODEL : ASN 2 6 6 : N , CA , C , O , CB , CG , OD1 , ND2 

50 PD4 9 8 FINALMODEL :PHE 

267:N I CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
PD4 9 8 FINALMODEL : ILE 2 7 2 : N , CA , C , 0 , CB , CGI , CG2 , CD1 
PD4 9 8 FINALMODEL :CA E281H:CA 
PD498FINALMODEL: CA E283H:NA 

55 

Subset ACTSITE: 

actsitemole. list 
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Svifc)S6t ACTSITE : 

PD498FINALMODEL: 36-42 , 57-60 , 66-80 , 100-110 , 115- 
116,119,132-136,160-164, 

PD4 9 8 FINALMODEL : 182-184 , 194 , 206-207 , 210 , 212- 
5 215,222-231 

actsiteatom. list 
Subset ACTSITE: 

PD4 9 8 FINALMODEL: ALA 36:N,CA,C,0, CB 

10 PD4 9 8 FINALMODEL :VAL 37 :N, CA, C # 0, CB, CGI, CG2 

PD498FINALMODEL: LEU 3 8 : N , CA , C , O , CB , CG , CD1 , CD2 
PD4 9 8 FINALMODEL : ASP 3 9 : N , CA , C , O , CB , CG , OD 1 , OD2 
PD49 8 FINALMODEL : SER 4 0 : N , CA , C , O , CB , OG 
PD4 9 8 FINALMODEL :GLY 41:N,CA,C,0 

15 PD4 9 8 FINALMODEL : VAL 42:N,CA,C # 0,CB,CG1,CG2 

PD4 9 8FINALMODEL : TYR 

57:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL: ASP 58 : N , CA, C , O, CB, CG, OD1 , OD2 
PD498 FINALMODEL : PHE 

20 59 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 

PD4 9 8 FINALMODEL : ILE 60 : N , CA , C , O , CB , CGI , CG2 , CD1 
PD4 9 8 FINALMODEL: PRO 66 : N , CA, CD , C, O, CB , CG 
PD4 9 8 FINALMODEL : MET 67 :N,CA,C,0,CB,CG, SD, CE 
PD4 9 8 FINALMODEL : ASP 68 : N , CA , C , O , CB , CG , OD1 , OD2 

25 PD4 9 8 FINALMODEL: LEU 69:N,CA,C,0,CB,CG,CD1,CD2 

PD4 9 8 FINALMODEL : ASN 70:N,CA,C,O,CB,CG,ODl,ND2 
PD4 9 8 FINALMODEL : GLY 71:N,CA,C,0 
PD4 9 8 FINALMODEL : HI S 

72:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

30 PD498FINALMODEL: GLY 73:N,CA,C,0 

PD498 FINALMODEL : THR 7 4 : N , CA , C , O , CB , OG 1 , CG2 
PD498 FINALMODEL : HI S 

75:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8 FINALMODEL : VAL 7 6 : N , CA , C , O , CB , CGI , CG2 

35 PD4 9 8 FINALMODEL : ALA 77:N,CA,C,0,CB 

PD4 9 8 FINALMODEL: GLY 78:N,CA,C,0 
PD4 9 8 FINALMODEL : THR 79:N,CA,C,0,CB,0G1,CG2 
PD4 9 8 FINALMODEL : VAL 8 0 : N , CA , C , O , CB , CG 1 , CG2 
PD4 9 8 FINALMODEL : LEU 1 0 0 : N , CA , C , 0 , CB , CG , CD1 , CD2 

40 PD4 9 8 FINALMODEL : ALA 101:N, CA, C, 0, CB 

PD4 9 8 FINALMODEL: VAL 102 :N, CA, C, 0, CB,CG1,CG2 
PD4 9 8 F INALMODEL : ARG 

103:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 F I N ALMOD EL : VAL 104 : N , CA , C , O , CB , CGI , CG2 

45 PD4 9 8FINALMODEL : LEU 105 :N, CA, C, O, CB, CG, CD1 , CD2 

PD4 9 8 FINALMODEL : ASP 1 0 6 : N , CA , C , 0 , CB , CG , OD1 , OD2 
PD 4 9 8 F IN ALMODEL : ALA 1 0 7 : N , C A , C , 0 , CB 
PD4 9 8 FINALMODEL : ASN 1 08 : N , CA , C , O , CB , CG , ODl , ND2 
PD4 9 8 FINALMODEL: GLY 109:N,CA,C,0 

50 PD498FINALMODEL: SER 110 :N, CA, C, 0, CB, OG 

PD4 9 8 FINALMODEL: SER 115 :N, CA, C, 0, CB, OG 
PD4 9 8 FINALMODEL: ILE 116 :N, CA, C, O, CB, CGI , CG2 , CD1 
PD 4 9 8 F I N ALMODEL : GLY 119:N,CA,C,0 
PD4 9 8 FINALMODEL : ASN 1 3 2 : N , CA , C , O , CB , CG , ODl , ND2 

55 PD4 9 8 FINALMODEL: LEU 133 :N, CA, C, 0, CB, CG, CD1 , CD2 

PD498FINALMODEL: SER 134 :N, CA, C, 0, CB, OG 
PD4 9 8 FINALMODEL : LEU 1 3 5 : N , CA , C , 0 , CB , CG , CD1 , CD 2 
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PD4 9 8FINALMODEL : GLY 136:N,CA,C,0 
PD4 9 8FINALMODEL : ALA 160 : N , CA , C , O , CB 
PD4 9 8FINALMODEL : ALA 161 :N, CA, C, O, CB 
PD4 9 8 FINALMODEL : ALA 162 :N, CA, C, O, CB 
5 PD4 9 8 FINALMODEL : GLY 163:N,CA,C,0 

PD4 9 8 FINALMODEL : ASN 164 : N , CA, C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL : VAL 18 2 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL: GLY 183:N,CA,C,0 
PD4 9 8FINALMODEL : ALA 184 :N, CA, C,0, CB 
10 PD498 FINALMODEL : PHE 

1 9 4 : N , CA , C , O , CB , CG , CD 1 , CD2 , CE1 , CE2 , CZ 
PD4 9 8 FINALMODEL : PRO 2 06 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL ; GLY 207:N,CA,C,O 

PD4 9 8 FINALMODEL : ILE 2 10 : N , CA, C , O , CB , CGI , CG2 , CD1 
15 PD4 9 8 FINALMODEL :SER 212 :N, CA, C,0,CB,OG 

PD4 9 8 FINALMODEL :THR 213 :N,CA,C,0,CB,0G1,CG2 
PD498 FINALMODEL : VAL 2 14 : N , CA , C , O , CB , CG 1 , CG2 
PD4 9 8 FINALMODEL : PRO 2 15 : N , CA, CD , C , O , CB , CG 
PD4 9 8 FINALMODEL : MET 2 2 2 : N , CA , C , O , CB , CG , SD , CE 
20 PD4 9 8 FINALMODEL :SER 223 :N, CA, C,0, CB, OG 

PD4 9 8 FINALMODEL: GLY 224:N,CA,C,0 
PD4 9 8FINALMODEL : THR 2 2 5 : N , CA , C , O , CB , OG1 , CG2 
PD498 FINALMODEL : SER 2 2 6 : N , CA , C , O , CB , OG 
PD49 8 FINALMODEL : MET 2 2 7 : N , CA , C , O , CB , CG , SD , CE 
25 PD4 9 8 FINALMODEL: ALA 228 :N, CA, C,O r CB 

PD4 9 8 FINALMODEL: SER 229 :N, CA, C, O, CB,OG 
PD4 9 8 FINALMODEL: PRO 230 :N, CA, CD, C , O , CB, CG 
PD4 9 8 FINALMODEL : HI S 
231:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

30 

Subset RESTx: 

restxmole . list 
Subset RESTX: 

NEWMODEL: 6-7 , 9-12 ,43-46 , 65 , 87- 
35 89,131,173,209,211,216-221,232-233, 
NEWMODEL: 262 , E282H 

restxatonw list 
Subset RESTX: 
40 NEWMODEL : PRO 6 : N, CA, CD, C, O, CB , CG 

NEWMODEL :TYR 
7:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
NEWMODEL : SER 9 :N,CA,C,0,CB,OG 
NEWMODEL : ALA 10:N,CA,C,O,CB 
45 NEWMODEL :TYR 

ll:N,CA,C,0,CB,CG,CDl,CD2,CEl,CE2,CZ,OH 

NEWMODEL : GLN 12 :N,CA,C,0,CB,CG,CD,0E1,NE2 
NEWMODEL : ASP 43 :N,CA,C,0,CB,CG,0D1,0D2 
NEWMODEL :TYR 
50 44:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

NEWMODEL: ASN 45 :N, CA,C,0,CB,CG,0D1,ND2 
NEWMODEL : HI S 46:N,CA,C,0, CB,CG,ND1,CD2 ,CE1,NE2 
NEWMODEL : ASN 6 5 : N , CA , C , O , CB , CG , OD 1 , ND2 
NEWMODEL : GLY 87:N,CA,C,0 
55 NEWMODEL : ILE 88:N,CA,C,0,CB,CG1,CG2 ,CD1 

NEWMODEL : GLY 89:N,CA,C,0 

NEWMODEL : LEU 1 3 1 : N , CA , C , 0 , CB , CG , CD1 , CD2 



WO 98/35026 



54 



PCTYDK98/00046 



NEWMODEL : PRO 173 :N, CA, CD, C, 0, CB, CG 
NEWMODEL : ASN 209 :N, CA,C,0,CB,CG,0D1,ND2 
NEWMODEL : ALA 211 :N, CA,C,0,CB 
NEWMODEL: ASN 216:N,CA,C,0,CB,CG,0D1,ND2 
5 NEWMODEL : ASN 217:N,CA,C,0,CB,CG,0D1,ND2 

NEWMODEL : GLY 218:N,CA,C,0 
NEWMODEL : T YR 
219:N,CA,C,0,CB,CG,CD1,CD2, CE1, CE2 , CZ , OH 
NEWMODEL : SER 220 :N, CA, C,0, CB,OG 

10 NEWMODEL :TYR 

221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
NEWMODEL : VAL 232 :N,CA,C,0,CB,CG1,CG2 
NEWMODEL : ALA 233 :N, CA, C,0, CB 
NEWMODEL : GLY 262:N,CA,C,0 

15 NEWMODEL :CA E282H:CA 



Example 2 

Suitable substitutions in Savinase® for addition of amino 
2 0 attachment groups f-NH ol 

The known X-ray structure of Savinase® was used to find 
where suitable amino attachment groups may is added (Betzel et 
al, (1992), J. Mol. Biol. 223, p. 427-445). 

The 3D structure of Savinase® is available in the Brookhaven 
25 Databank as lsvn.pbd. A related subtilisin is available as 
1st 3 .pdb. 

The sequence of Savinase® is shown in SEQ ID NO. 3 
The sequence numbering used is that of subtilisin BPN 1 , 
Savinase® having deletions relative to BPN 1 at positions: 36, 
30 56, 158-159 and 163-164. The active site residues (functional 
site) are D32,H64 and S221. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and makeKzone2 .bcl below: 



35 Conservative substitutions: 
makeKzone.bcl 
Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
Zone Subset LYS :lys:NZ Static monomer/residue 10 Color_Subset 
40 255 255 0 

Zone Subset NTERM :el:N Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE : editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE :e32 ,e64 ,e221 Static monomer /residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union LYS NTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 

#N0TE: editnextline object name according to the protein 
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Combine Subset REST Difference SAVI8 ALLZONE 
List Subset REST Atom Output File restatom. list 
List Subset REST monomer /residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 List Subset ACTSITE Atom Output File actsiteatom. list 
List Subset ACTSITE monomer /residue Output_File 
actsiteraole. list 
# 

Zone Subset REST5A REST Static Monomer/Residue 5 -Color_Subset 
10 Combine Subset SUB 5 A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output_File subSbatom. list 
List Subset SUB5B monomer/residue Output_File sub5bmole. list 
15 #Now identify sites for lys->arg substitutions and continue 
with makezone2 .bcl 

#Use grep command to identify ARG in restatom. list , 
subSbatom. list & accsiteatom. list 

2 0 Comments : 

In this case of Savinase® REST contains the Arginines ArglO, 
Argl70 and Arg 186, and SUB5B contains Argl9, Arg45, Argl45 and 
Arg247. 

These residues are all solvent exposed. The substitutions 
25 R10K, R19K, R45K, R145K, R170K, R186K and R247K are identified 
in Savinase® as sites for mutagenesis within the scope of this 
invention. The residues are substituted below in section 2, 
and further analysis done. The subset ACTSITE contains Lys94. 

The substitution K94R is a mutation removing Lysine as 

3 0 attachment group close to the active site. 



Non-conservative substitutions: 
makeKzone2 . bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
35 # 

#having scanned lists (grep arg command) and identified sites 
for lys->arg substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace SAVI8 newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On SAVI8 

#NOTE: editnextlines with lys->arg positions 

Replace Residue newmodel :el0 lys L 
45 Replace Residue newmodel :el7 0 lys L 

Replace Residue newmodel :el8 6 lys L 

Replace Residue newmodel :el9 lys L 

Replace Residue newmodel :e45 lys L 

Replace Residue newmodel : el4 5 lys L 
50 Replace Residue newmodel :e241 lys L 
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#Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
5 Zone Subset LYSx newmodel: lys:NZ Static monomer/ residue 10 
ColorJSubset 255,255,0 

Zone Subset NTERMx newmodel: el :N Static monomer /residue 10 
ColorJSubset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel :e3 2, e64,e2 21 Static 

monomer /residue 8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union LYSx NTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

Combine Subset RESTx Difference newmodel ALLZONEx 
15 List Subset RESTx Atom Output File restxa torn. list 

List Subset RESTx monomer/ residue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255, 0,0 
List Subset ACTSITEx Atom Output^File actsitexatom. list 
20 List Subset ACTSITEx monomer/ residue Output_File 
actsitexmole. list 
# 

#read res txatom. list or restxmole. list to identify sites for 
(not_arg) ->lys subst. if needed 

25 

Comments : 

Of the residues in RESTx, the following are >5% exposed (see 

lists below): 5,14,22,38-40,42,75-76,82,86,103-105,108,133- 

135,137,140,173,204,206,211-213,215-216,269. The following 

30 mutations are proposed in Savinase®: P5K, P14K, T22K, T38K, 

H39K, P40K, L42K, L75K, N76K, L82K, P86K, S103K, V104K, S105K, 

A108K, A133K, T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, 

G211K, S212K, T213K, A215K, S216K, N269K. 

Relevant data for Example 2: 

35 Solvent accessibility data for SAVINASE®: 

# SAVI8NOH20 Fri Nov 29 13:32:07 MET 1996 





# residue 


area 




ALA 1 


118.362808 




GLN 2 


49.422764 


40 


SER 3 


61.982887 




VAL 4 


71.620255 




PRO 5 


21.737535 




TRP 6 


58.718731 




GLY 7 


4.328117 


45 


ILE 8 


6.664074 




SER 9 


60.175900 




ARG 10 


70.928963 




VAL 11 


2.686934 




GLN 12 


72.839996 


50 


ALA 13 


0.000000 




PRO 14 


52.308453 




ALA 15 


38.300892 




ALA 16 


0.000000 
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HISJL7 
ASN_18 
ARG_19 
GLY_20 
5 LEU_21 
THR_22 
GLY_23 
SER_24 
GLY_25 

10 VAL_26 
LYS_27 
VAL_28 
ALA_29 
VAL_30 

15 LEU_31 
ASP_32 
THRJ33 
GLY_34 
ILE_35 

20 SERJ3 6 
THR_37 
HIS_3 8 
PRO_39 
ASP_40 

25 LEU_41 
ASN_42 
ILE_43 
ARG_44 
GLY_45 

30 GLY_46 
ALA_47 
SER_48 
PHE_49 
VAL_50 

35 PRO_51 
GLY_52 
GLU_53 
PRO_54 
SER_55 

40 THR_56 
GLN_57 
ASP_58 
GLY_59 
ASN_60 

45 GLY_61 
HIS_62 
GLY_63 
THR_64 
HIS_65 

50 VAL_66 
ALA_67 
GLY_68 
THR_69 
ILE_70 

55 ALAJ71 
ALAJ72 
LEU 73 



41.826324 

136.376602 

105.678642 

48.231510 

17.196377 

36.781742 

0.000000 

64.151276 

50.269905 

4.030401 

54.239555 

0.000000 

0.000000 

3.572827 

0.233495 

1.074774 

1.973557 

3.638052 

8.044439 

8.514903 

122.598907 

18.834011 

76.570526 

0.000000 

19.684013 

88.870216 

56.117710 

110.647194 

26.935413 

35.515778 

21.495472 

34.876190 

52.647541 

23.364208 

110.408752 

80.282906 

43.033707 

124.444336 

60.284889 

47.103241 

120.803505 

12.784743 

61.742443 

56.760231 

1.576962 

38.590118 

0.000000 

0.537387 

0.968253 

1.612160 

0.000000 

2.801945 

9.074596 

0.000000 

4.577205 

0.000000 

47.290039 
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ASNJ74 
ASNJ75 
SER_76 
ILEJ77 
5 GLY_78 
VAL_79 
LEU_80 
GLY_81 
VAL_82 

10 ALA_83 
PRO_84 
SER_85 
ALA_86 
GLU_87 

15 LEU_88 
TYR_89 
ALA_90 
VAL_91 
LYS_92 

20 VAL_93 
LEU_94 
GLY_95 
ALA_96 
SER_97 

25 GLY_98 
SER_99 
GLY_100 
SER_101 
VAL_102 

30 SER_103 
SER_104 
ILE_105 
ALA_106 
GLNJL07 

35 GLYJL08 
LEU_109 
GLU_110 
TRP_111 
ALAJL12 

40 GLY_113 
ASN_114 
ASN_115 
GLY_116 
MET_117 

4 5 HIS_118 
VAL_119 
ALAJL20 
ASNJL21 
LEUJL22 

50 SERJL23 
LEU_124 
GLYJL25 
SER_12 6 
PRO_127 

55 SER_128 
PRO_129 
SER 130 



102.187248 

60.210400 

84,614494 

66.098572 

17.979534 

5.642561 

13.025185 

0.000000 

0.268693 

0.000000 

18.193810 

56.839039 

13.075745 

37.011765 

2.149547 

30.633518 

I. 343467 
0.779450 
5.862781 
0.466991 
10.747736 
8.707102 
41.414677 
96.066040 
33.374485 
67.664116 
35.571117 
54. 096992 
52. 695324 
62.929684 
8.683097 
15.852910 
14.509443 
94.463066 
0.000000 
0.537387 
63.227707 
55.500740 
0.502189 

II. 908267 
107.208527 
78.811234 
41.453194 
9.634291 
54.022118 
5.105174 
0.268693 
0.233495 
0.537387 
4.004620 
21.927265 
55.952454 
40.241180 
107.409439 
57.988609 
85.021118 
20.460915 
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ALA 


131 


57.404362 




THR 


132 


74.438805 




LEU 


133 


12.091203 




GLU 


134 


73.382019 


5 


GLN 


135 


114.870010 




ALA 


136 


2.122917 




VAL 


137 


1.074774 




ASN" 


138 


55.622704 




SER 


139 


29.174965 


10 


ALA" 


140 


0.268693 




THR 


141 


27.962946 




SER 


142 


87.263145 




ARG 


143 


88.201218 




GLY 


144 


38.477882 


15 


VAL 


145 


2.079151 




LEU 


146 


13 .703363 




VAL 


147 


2.690253 




VAL 


148 


1.074774 




ALA 


"149 


0.000000 


20 


ALA 


*150 


4.356600 




SER 


151 


0.000000 




GLY 


152 


12.628590 




ASN~ 


153 


84.248703 




SER~ 


"154 


77.662354 


25 


GLY 


"155 


25.409861 




ALA 


"156 


38.074570 




GLY 


"157 


40.493744 




SER 


"158 


53.915291 




ILE~ 


"159 


4.352278 


30 


ser" 


"160 


12.458543 




tyr" 


"161 


29.670284 




pro" 


"162 


4.030401 




ALA" 


"163 


0.968253 




arg~ 


*164 


84.059120 


35 


tyr" 


"165 


28.641129 




ALA 


"166 


68.193314 




ASN" 


"167 


61.686481 




ALA" 


"168 


0.537387 




MET" 


"169 


0.586837 


40 


ALA 


"170 


0.000000 




VAL 


"171 


0.000000 




GLY" 


'172 


0.000000 




ALA' 


"173 


0.933982 




THR" 


"174 


3.013133 


45 


ASP" 


"175 


34.551376 




GLN~ 


"176 


96.873039 




ASN" 


"177 


98.664368 




asn" 


"178 


41.197159 




asn" 


"179 


60.263512 


50 


arg" 


"180 


64.416336 




ALA" 


"181 


7.254722 




SER" 


"182 


91.590881 




phe" 


"183 


52.126518 




ser' 


"184 


2.101459 


55 


gln" 


"185 


15.736279 




TYR" 


"186 


44.287792 




GLY" 


"187 


5.114592 
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ALA 


188 


69.406563 




GLY 


189 


36.926083 




LEU 


190 


16.511177 




ASP~ 


191 


7.705349 


5 


ile" 


192 


0.268693 




VAL 


193 


4.299094 




ALA 


194 


0.000000 




PRO 


195 


0.806080 




GLY~ 


196 


0.000000 


10 


VAL 


197 


25.257177 




ASN 


198 


82.177422 




VAL" 


199 


10.747736 




GLN~ 


'200 


80.374527 




ser" 


"201 


2.008755 


15 


THR 


202 


0.000000 




TYR 


203 


80.679886 




PRO 


"204 


34.632195 




GLY 


205 


74.536827 




SER" 


"206 


74.964920 


20 


THR 


207 


57.070065 




TYR 


'208 


82-895500 




ALA 


209 


22.838940 




SER 


'210 


69.045639 




LEU 


"211 


49.708279 


25 


ASN 


212 


86.905457 




GLY 


'213 


2.686934 




THR~ 


214 


4.669909 




SER" 


215 


15.225292 




MET 


216 


7.261287 


30 


ALA* 


217 


0.000000 




THR 


"218 


0.000000 




PRO" 


219 


0.806080 




HIS 


"220 


2. 662697 




VAL 


*221 


0.268693 


35 


ALA" 


'222 


0.000000 




GLY* 


223 


0.000000 




ALA" 


"224 


7.206634 




ALA" 


'225 


1.039576 




ALA" 


"226 


0.268693 


40 


LEU" 


"227 


1.074774 




VAL" 


"228 


1.541764 




LYS" 


229 


39.262505 




GLN" 


"230 


54.501614 




LYS" 


"231 


81.154129 


45 


ASN 


'232 


30.004124 




PRO" 


"233 


91.917931 




SER" 


"234 


102.856705 




TRP" 


"235 


64.639481 




SER" 


"236 


51.797619 


50 


ASN" 


"237 


24.866917 




VAL" 


"238 


78.458466 




GLN" 


"239 


73.981461 




ile" 


"240 


14.474245 




arg" 


"241 


41.242931 


55 


asn~ 


"242 


64.644814 




his" 


"243 


50.671440 




LEU 


"244 


5.127482 
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LYS 


245 


48.820000 




ASN~ 


246 


115.264534 




thr" 


"247 


22.205376 




ALA 


'248 


16.415077 


5 


THR 


"249 


60.503101 




SER 


"250 


74.511597 




LEU"" 


"251 


48.861599 




GLY 252 


39.124340 




SER 


253 


49.811481 


10 


thr" 


'254 


88.421982 




ASN~ 


"255 


72.490181 




LEtf 


"256 


54.835758 




TYR~ 


"257 


38.798912 




gly*" 


"258 


3.620916 


15 


SER" 


"259 


35.017368 




GLY* 


"260 


0.537387 




LEU" 


"261 


8.598188 




VAL~ 


"262 


4.519700 




asn~ 


"263 


16.763659 


20 


ALA"* 


"2 64 


3.413124 




GLU" 


"2 65 


37.942276 




ALA" 


"266 


15.871746 




ALA"" 


"267 


3.947115 




THR" 


"268 


2.475746 


25 


arg" 


"269 


176.743362 




ion- 


"270 


0.000000 




ion"" 


"271 


5.197493 




Subset REST: 



restmole. list 



3 0 Subset REST * 

SAVI8:E5-E15,E17-E18,E22,E38-E40,E42-E43,E73-E76,E82-E86,E103- 
E105, 

SAVI8 : E108-E109 , E111-E112 , E115-E116 , E122 , E128-E144 , E149- 

Ei50,Ei56-Ei57, 

35 SAVI8 : E160-E162 , E165-E168 , E170-E171 , E173 , E180-E188 , E190- 
E192,E200 / 

SAVI8 : E203-E204 , E206 , E211-E213 , E215-E216 , E227-E230 , E255- 
E259,E261-E262 # 
SAVI8:E267-E269 
40 restatom. list 
Subset REST: 

SAVI8:PRO E5:N,CD,CA,CG,CB,C,0 

SAVI8:TRP E6 : N, CA, CD2 , CE2 , NE1, CD1, CG, CE3 , CZ3 , CH2 , CZ2 ,CB,C, O 
SAVI8:GLY E7:N,CA,C,0 
45 SAVI8:ILE E8 : N, CA, CD1, CGI , CB, CG2 , C, O 
SAVI8:SER E9:N,CA,OG,CB,C,0 

SAVI8:ARG E10 :N, CA,NH2 ,NH1 , CZ, NE, CD, CG, CB, C, 0 
SAVI8:VAL E11:N,CA,CG2,CG1,CB,C,0 
SAVI8:GLN E12:N,CA,NE2,0E1,CD,CG,CB,C,0 
50 SAVI8:ALA E13 :N,CA,CB,C,0 

SAVI8:PRO E14:N,CD,CA,CG,CB,C,0 
SAVI8:ALA E15:N,CA,CB,C,0 

SAVI8:HIS E17:N,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 
SAVI8:ASN E18:N,CA,ND2,0D1,CG,CB,C,0 
55 SAVI8:THR E22 :N, CA,CG2 , OG1 , CB, C, 0 
SAVI8:THR E38 :N, CA, CG2 , OG1 , CB, C, O 
SAVI8:HIS E39:N,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 
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SAVI8 : 


PRO 


E40:N,CD,CA,CG,CB,C,0 




SAVI8: 


LEU 


E42:N,CA,CD2,CD1,CG,CB,C,0 




SAVI8: 


ASN 


E43:N,CA,ND2,0D1,CG,CB,C,0 




SAVI8 : 


ALA 


E73:N,CA,CB,C,0 


5 


SAVI8: 


ALA 


E74:N,CA,CB,C,0 




SAVI8: 


LEU 


E75:N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 : 


ASN 


E76:N,CA,ND2,0D1,CG,CB,C,0 




SAVI8 : 


LEU 


E82:N,CA,CD2,CD1,CG,CB,C,0 




SAVI8: 


GLY 


E83:N,0A,C,O 


10 


SAVI8 : 


VAL 


E84:N,CA,CG2,CG1,CB,C,0 




SAVI8 : 


ALA 


E85:N,CA,0B,C,O 




SAVI8: 


PRO 


E86:N,CD,CA,CG,CB,C,0 




SAVI8: 


SER 


E103: 


N,CA, 00,08,0,0 




SAVI8 : 


VAL 


E104: 


N, OA, CG2 ,001,06,0,0 


15 


SAVI8 : 


SER 


E105: 


N , OA , OG , CB ,0,0 




SAVI8: 


ALA 


E108: 


N,CA,CB,C,0 




SAVI8: 


GLN 


E109: 


N, OA, NE2 , OEl , CD, CG, CB, 0,0 




SAVI8 s 


LEU 


Bill: 


N, OA, 002,001,00,06,0,0 




SAVI8: 


GLU 


E112: 


N,CA,OE2,OE1,CD,CG,CB,C,0 


20 


SAVI8 : 


GLY 


E115: 


N, OA, 0,0 




SAVI8 : 


ASN 


E116: 


N,CA,ND2,OD1,CG,CB,C,0 




SAVI8 : 


ALA 


E122: 


N,CA,CB,C,0 




SAVI8 : 


SER 


E128: 


N,CA,OG,CB,C,0 




SAVI8! 


PRO 


E129: 


N,CD,CA,CG,CB,C,0 


25 


SAVI8 : 


SER 


E130: 


N,CA,OG,CB,C,0 




SAVI8 : 


PRO 


E131: 


N , CD , OA , CG , CB , C , 0 




SAVI8: 


SER 


E132: 


N,CA,OG,CB,C,0 




SAVI8 : 


ALA 


E133: 


N,CA,C8,C,0 




SAVI8 : 


THR 


E134: 


N,CA,CG2,OG1,CB,C,0 


30 


SAVI8: 


LEU 


E135: 


N,CA,CD2 ,001,00,06,0,0 




SAVI8: 


GLU 


E136: 


N,CA,0E2 ,OE1,CD,CG,CB,C,0 




SAVI8: 


GLN 


E137: 


N,CA,NE2,OEl,CD,CG,C6,C,0 




SAVI8: 


ALA 


E138: 


N,CA,C8,C,0 




SAVI8 : 


.VAL 


E139: 


:N,CA,CG2,CG1,C8,C,0 


35 


SAVI8 ; 


:ASN 


E140: 


:N, CA,ND2 , ODl , CG, CB , C,0 




SAVI8 : 


:SER 


E141: 


:N,CA,0G,CB,C,0 




SAVI8 ' 


: ALA 


E142: 


:N,CA,CB,C,0 




SAVI8 


:THR 


E143' 


:N,CA,CG2,OG1,CB,C,0 




SAVI8 


:SER 


B144 


:N,CA,OG,CB,C,0 


40 


SAVI8 


:VAL 


E149 


:N,CA,CG2,CG1,CB,C,0. 




SAVI8 


:VAL 


E150 


:N,CA, 002,061,06,0,0 




SAVI8 


:SER 


E156 


:N,CA,OG,CB,C,0 




SAVI8 


: GLY 


E157 


:N,CA,C,0 




SAVI8 


: ALA 


E160 


:N,CA,CB,C,0 


45 


SAVI8 


:GLY 


E161 


:N,CA,C,0 




SAVI8 


SSER 


E162 


:N,CA,OG,CB,C,0 




SAVI8 


:ILE 


E165 


:N,CA,CD1,CG1,CB / CG2,C,0 




SAVI8 


:SER 


E166 


:N,CA,0G,CB,C,0 




SAVI8 


:TYR 


E167 


:N / CA,0H,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 


50 


SAVI8 


:PRO 


E168 


:N,CD,CA,CG,CB,C,0 




SAVI8 


: ARG 


E170 


:N,CA,NH2,NH1,CZ,NE,CD,CG,C8,C,0 




SAVI8 


:TYR 


E171:N,CA,0H,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 




SAVI8 


:ASN 


E173 :N,CA,ND2,ODl,CG,CB,C,0 




SAVI8 


:THR 


E180:N,CA,CG2 ,OG1,CB,C,0 


55 


SAVI8 


:ASP 


E181 


:N,CA,0D2,0D1,CG,C6,C,0 




SAVI8 


: GLN 


E182 :N,CA,NE2,OEl,CD,CG,CB,C,0 




SAVI8 


:ASN 


E183 :N,CA,ND2,ODl,CG,CB,C,0 
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SAVI8 : 


ASN 


E184: 


N, 


CA,ND2,0D1,CG,CB,C,0 






SAVI8 : 


ASN 


E185: 


>N, 


CA , ND2 , 0D1 , CG , CB , C , 0 






SAVI8 : 


ARG 


E186: 


;N, 


CA,NH2,NH1,CZ,NE, CD,CG,CB,C, 


0 




SAVI8 ; 


ALA 


E187: 


;n, 


CA,CB,C, 0 




5 


SAVI8: 


SER 


E188: 


N, 


CA,OG,CB,C,0 






SAVI8: 


SER 


E190: 


;N, 


CA,OG,CB,C,0 






S AVI 8 J 


GLN 


E191: 


:N, 


CA,NE2,0E1,CD,CG,CB,C,0 






SAVI8: 


TYR 


E192: 


:N, 


CA, OH, CZ , CD2 , CE2 , CE1 , GDI , CG , 


CB,C,0 




SAVI8: 


ALA 


E2 00: 


;N # 


CA,CB,C,0 




10 


SAVI8; 


>VAL 


E203 


;N, 


CA,CG2 , CG1,CB,C,0 






SAVI8: 


:ASN 


E204: 


:N, 


CA,ND2,0D1,CG,CB,C,0 






SAVI8: 


: GLN 


E206< 


tN, 


CA,NE2,0E1,CD,CG,CB,C,0 






SAVI8 


:GLY 


E211 


:N, 


CA,C,0 






SAVI8 


;SER 


E212 


:N, 


CA,0G,CB,C,0 




15 


SAVI8 


:THR 


E213 


;N, 


CA,CG2,0G1,CB,C,0 






SAVI8 


: ALA 


E215 


:N, 


CA,CB,C f 0 






SAVI8 


:SER 


E216 


:N ( 


CA,0G,CB,C,0 






SAVI8 


:VAL 


E227 


:N, 


CA,CG2,CG1,CB,C,0 






SAVI8 


: ALA 


E228 


:N, 


CA, CB,C,0 




20 


SAVI8 


: GLY 


E229 


:N, 


CA,C,0 






SAVI8 


: ALA 


E230 


:N, 


CA, CB, C,0 






SAVI8 


: THR 


E255 


:N, 


CA,CG2,0G1,CB,C,0 






SAVI8 


:SER 


E256 


;N, 


CA,0G,CB,C,0 






SAVI8 


:LEU 


E257 


:N, 


CA, CD2 , CD1 , CG, CB , C , 0 




25 


SAVI8 


:GLY 


E258 


:N, 


CA,C,0 






SAVI8 


:SER 


E259 


:N, 


CA,OG,CB,C,0 






SAVI8 


:ASN 


E261 


:N, 


CA,ND2,0D1,CG,CB,C,0 






SAVI8 


:LEU 


E262 


:N, 


CA , CD2 , CD1 , CG , CB , C , 0 






SAVI8 


:LEU 


E267 


:N, 


CA , CD2 , CD1 , CG , CB , C , 0 




30 


SAVI8 


:VAL 


E268 


:N, 


CA,CG2,CG1,CB,C,0 






SAVI8 


:ASN 


E269 




CA,ND2,0D1,CG,CB,C,0 





Subset SUB5B: 

sub5bn\ole. list 
Subset SUB5B: 

35 SAVI8 : E2-E4 , E16 , E19-E21 , E23-E24 , E28 , E37 , E41, E44-E45 , 
E77-E81 / E87-E88, 

SAVI8 : E90 , E113-E114 , E117-E118 , E12 0-E121 , E145- 
E148 / E169,E172 ,E174-E176, 
SAVI8:E193-E196,E198-E199,E214,E231- 
40 E234,E236,E243,E247,E250,E253-E254 £ 

SAVI8:E2 60, E263-E266 , E270-E273 ,M276H-M277H 

subSbatom. list 
Subset SUB5B: 

SAVI8:GLN E2 :N, CA,NE2,0El,CD,CG,CB,C,O 
45 SAVI8:SER E3 : N , CA , OG , CB , C , 0 

SAVI8:VAL E4 :N, CA, CG2 , CGI , CB, C, O 
SAVI8:ALA E16 :N, CA, CB, C,0 

SAVI8:ARG E 19 : N , CA , NH2 , NH1 , CZ , NE , CD , CG , CB , C , O 

SAVI8:GLY E20:N,CA,C,0 
50 SAVI8:LEU E2 1 : N , CA , CD2 , CD1 , CG , CB , C , O 

SAVI8:GLY E23:N,CA,C,0 

SAVI8:SER E24:N,CA,OG,CB,C,0 

SAVI8:VAL E28 :N, CA, CG2 , CGI, CB, C, 0 

SAVI8:SER E37:N,CA,OG,CB,C,0 
55 SAVI8:ASP E41:N,CA,OD2,ODl,CG,CB,C,0 

SAVI8:ILE E44:N,CA,CD1,CG1,CB,CG2,C,0 

SAVI8:ARG E45:N,CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 



ASN 
SER 
ILE 
GLY 
VAL 
SER 
ALA 
LEU 
TRP 
ALA 
ASN 
GLY 
HIS 
VAL 
ARG 
GLY 
VAL 
LEU 
ALA 
ALA 
ALA 
MET 
ALA 
GLY 
ALA 
GLY 
LEU 
ILE 
VAL 
TYR 
ALA 
ALA 
LEU 
VAL 
GLN 
ASN 
ARG 
LEU 
THR 
ALA 
THR 
TYR 
GLY 
SER 
GLY 
ALA 
GLU 
ALA 
ALA 
ION 
ION 



E77:N,CA,ND2,0D1,CG,CB,C,0 

E78:N,CA,OG,CB,C,0 

E79:N,CA,CD1,CG1,CB,CG2,C,0 

E80:N,CA,C,O 

E81:N,CA,CG2,CG1,CB,C,0 

E87:N,CA,OG,CB,C,0 

E88:N,CA,CB,C,0 

E90:N,CA,CD2,CD1,CG,CB,C,0 

E113:N,CA,CD2,CE2,NE1,CD1,CG,CE3,CZ3,CH2,CZ2,CB,C,0 

E114:N,CA,CB,C,0 

E117:N,CA,ND2,0D1,CG,CB,C,0 

E118zN CA C O 

E120:n!cA^CD2,NE2,CE1 ,ND1,CG,CE,C,0 

E121:N,CA,CG2,C61,CE,C,0 

E145:N,CA,NH2,NH1,CZ,NE,CD, CG,C6,C,0 

E146:N, CA,C,0 

E147:N,CA, CG2 ,CG1,CB,C,0 

E148:N,CA, 002,001,06, 06,0,0 

E169:N,CA,CB,C,0 

E172:N,CA,CB,C,0 

E174:N,CA,CB,C,0 

E175:N,CA,CE, 80,06,06,0,0 

E176:N,CA,CB,C,0 

E193:N,CA,C,0 

E194:N / CA,CB,C,0 

E195:N,CA,C,0 

E196:N,CA, 002,001, 06,06,0,0 

E198:N,CA,CD1,CG1,C8,C62,C,0 

E199:N,CA,CG2,CG1,CB,C,0 

E2 1 4 : N , C A , OH , C Z , CD2 , CE2 , CE 1 , CD 1 , 06 , CB , C , O 

E231:N,CA,CB,C,0 

E232:N,CA,CB,C f O 

E23 3:N,CA,CD2,CD1,C6,CB,C,0 

E234:N,CA,C62,C61,CB,C,0 

E236:N,CA / NE2,0E1,CD,C6,CB,C,0 

E243:N,CA,ND2, 001,06,06, 0,0 

E247:N,CA,NH2,NHl,CZ,NE,CD / CG / CB,C / 0 

E250:N,CA, 002,001,06,06,0,0 

E253:N,CA,CG2,OGl,C8, 0,O 

E254:N,CA,C8,C,0 

E260:N,CA / CG2,OGl,C6 f 0,0 

E263:N,CA / 0H,CZ,CD2,CE2 / CE1,CD1,CG,C6,C,0 

E264:N,CA,C,0 

E265:N,CA, 06,06,0,0 

E266:N,CA,C,0 

E270:N,CA,C8,C,0 

E271:N,CA,0E2,0E1,CD,C0,C8,C,0 

E272:N,CA,C6,C,0 

E273:N,CA,CE,C,0 

M276H:CA 

M277H:CA 



Subset ACTSITE: 

actsitemole. list 
Subset ACTSITE: 

SAVI8 : E29-E35 , E48-E51 , E54 , E58-E72 , E91-E102 , E106-E107 , E110 , E123- 
E127, 
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SAVI8: E151-E155,E177-E179,E189, E201-E202 , E205 , E207-E2 10 , E217- 
E226 

actsiteatom. list 
5 Subset ACTSITE: 





SAVI8: 


ALA 


E29: 


N,CA,CB,C,0 






SAVI8 : 


VAL 


E30: 


N,CA,CG2,CG1,CB,C,0 






SAVI8: 


LEU 


E31: 


N , CA , CD2 , CD 1 , CG , CB , C , 0 






SAVI8: 


ASP 


E32: 


N,CA,0D2,ODl,CG,CB,C,O 




10 


SAVI8: 


THR 


E33: 


N,CA,CG2,0G1,CB,C,0 






SAVI8: 


GLY 


E34: 


N,CA,C,0 






SAVI8: 


ILE 


E35: 


N , CA , CD1 , CGI , CB , CG2 , C , 0 






SAVI8: 


ALA 


E48; 


N,CA,CB,C,Q 






SAVI8 : 


SER 


E49: 


N,CA, OG,CB,C,0 




15 


SAVI8: 


PHE 


E50: 


N , CA , CD2 , CE2 , CZ , CE1 , CD1 , CG , 


CB,C,0 




SAVI8 : 


VAL 


E5U 


N, CA, CG2 , CGI , CB , C , 0 






SAVI8 : 


GLU 


E54: 


,N, CA,OE2,OEl,CD,CG,CB,C,0 






SAVI8 : 


THR 


E58: 


;N,CA,CG2,0G1,CB,C,0 






SAVI8: 


GLN 


E59: 


;N,CA,NE2,0E1,CD,CG,CB, C f O 




20 


SAVI8 : 


ASP 


E60: 


; N , CA , OD2 , OD 1 , CG , CB , C , 0 






SAVI8 : 


GLY 


E6U 


:N,CA,C,0 






SAVI8J 


ASN 


E62: 


;N,CA,ND2,0D1,CG,CB,C,0 






SAVI8 : 


GLY 


E63: 


:N,CA,C,0 






SAVI8: 


HIS 


E64: 


;N, CA, CD2 , NE2 , CE1 , ND1 , CG, CB, 


c,o 


25 


SAVI8 : 


GLY 


E65: 


:N,CA,C,0 






SAVI8: 


THR 


E66: 


;N,CA,CG2,0G1,CB,C,0 






SAVI8: 


HIS 


E67; 


:N,CA,CD2,NE2,CE1,ND1,CG,CB, 


C,0 




SAVI8 : 


VAL 


E68: 


: N , CA, CG2 , CGI , CB, C, 0 






SAVI8: 


ALA 


E69: 


:N,CA,CB,C,0 




30 


SAVI8: 


GLY 


E70: 


:N,CA,c,o 






SAVI8 : 


THR 


E71: 


:N,CA,CG2,0G1,CB,C,0 






SAVI8 J 


.ILE 


E72< 


:N,CA,CD1,CG1,CB,CG2,C,0 






SAVI8 : 


:TYR 


E91 


:N,CA,0H,CZ,CD2,CE2,CE1,CD1, 


CG , CB , C , O 




SAVI8; 


; ALA 


E92 


:N,CA,CB,C,0 




35 


SAVI8: 


:VAL 


E93 


:N,CA,CG2,CG1,CB,C,0 






SAVI8< 


:LYS 


E94 


:N,CA,NZ,CE,CD,CG,CB,C,0 






SAVI8 


:VAL 


E95 


:N,CA,CG2,CG1,CB,C,0 






SAVI8 


:LEU 


E96 


:N,CA,CD2,CD1,CG,CB,C,0 






SAVI8 


:GLY 


E97 


:N,CA,C,0 




40 


SAVI8 


: ALA 


E98 


:N,CA,CB,C,0 






SAVI8 


:SER 


E99 


:N,CA,OG,CB,C,0 






SAVI8 


:GLY 


E100:N,CA,C,O 






SAVI8 


:SER 


E101:N,CA,OG,CB,C,O 






SAVI8 


:GLY 


E102:N,CA,C,0 




45 


SAVI8 


:SER 


E106:N,CA,OG,CB,C,O 






SAVI8 


:ILE 


E107 : N , CA , CD1 , CGI , CB , CG2 , C , 0 






SAVI8 


:GLY 


E110:N,CA,C,0 






SAVI8 


:ASN 


E123:N,CA,ND2,0D1,CG, CB,C,0 






SAVI8 


:LEU 


E124:N,CA,CD2, CD1,CG, CB,C,0 




50 


SAVI8 


:SER 


E125:N,CA,OG,CB,C,0 






SAVI8 


:LEU 


E12 6 : N , CA , CD2 , CD1 , CG , CB , C , 0 






SAVI8 


:GLY 


E127:N,CA,C,0 






SAVI8 


: ALA 


E151:N,CA,CB,C,0 






SAVI8 


: ALA 


E152:N,CA,CB,C,0 




55 


SAVI8 


:SER 


E153:N,CA,OG,CB, C,0 






SAVI8 


: GLY 


E154:N,CA,C,0 






SAVI8 


:ASN 


E155 : N , CA , ND2 , OD1 , CG , CB , C , 0 
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SAVI8:VAL E177:N,CA,CG2,CG1,CB,C,0 
SAVI8:GLY E178 :N, CA, C,0 
SAVI8:ALA E179:N,CA,CB,C,0 

SAVI8:PHE E189 :N,CA / CD2 / CE2 ,CZ, CE1, 001,00,06,0,0 
5 SAVI8:PR0 E201:N,CD,CA / CG,CB,C,0 

SAVI8:GLY E202 :N,CA,C,0 

SAVI8:VAL E205:N,CA,CG2,CG1,CB,C,0 

SAVI8:SER E207 :N, CA,OG, CB, 0,0 

SAVI8:THR E208 : N, CA, CG2 , 0G1 , CB, C, O 
10 SAVI8:TYR E209:N,CA,OH,C2,CD2,CE2,CE1,CD1,CG,CB,C,0 

SAVI8 : PRO E2 10 : N , CD , CA , CG , CB , C , O 

SAVI8:LEU E217 : N, CA, CD2 , CD1 , CG, CB, C, O 

SAVI8:ASN E218:N,CA,ND2,0D1,CG,CB,C,0 

SAVI8:GLY E219:N,CA,C,0 
15 SAVI8:THR E220 :N, CA, CG2 , 0G1 , CB, C, O 

SAVI8:SER E221 :N, CA, OG, CB, C,0 

SAVI8:MET E222:N,CA,0E,SD,CG,CB,C,O 

SAVI8:ALA E223 :N, CA, CB, 0,0 

SAVI 8 : THR E2 2 4 : N , CA , CG2 , 0G1 , CB , C , O 
20 SAVI8:PR0 E225:N,CD,CA,CG,CB,C,0 

SAVI8:HIS E226 : N, CA, CD2 , NE2 , CE1 ,ND1 , CG, CB, C, 0 
Subset RESTx: 

res txmole. list 
Subset RESTX: 
25 NEWMODEL: E5 , E13-E14 , E22 , E38-E40 , 

E42,E73-E76,E82-E86,E103-E105, 

NEWMODEL: E108 , E122 , E133-E135, E137-E140, 
E149-E150,E173,E204,E206, 

NEWMODEL: E211-E213 ,E215-E216,E227- E229, 
30 E258,E269 
restxatom. list 
Subset RESTX: 

NEWMODEL: PRO E5 :N, CD, CA, CG, CB, 0,0 

NEWMODEL : ALA E13 :N, CA, CB , C, O 
35 NEWMODEL : PRO E14 :N, CD, CA, CG, CB, C, 0 

NEWMODEL: THR E22 :N, CA, CG2 , 0G1 r CB, C, O 

NEWMODEL: THR E38 : N , CA, CG2 , 0G1 , CB, C , O 

NEWMODEL: HIS E39:N,CA,CD2,NE2, CE1 ,ND1 , CG, CB, C, O 

NEWMODEL: PRO E40:N,CD,CA,CG,CB,C,0 
40 NEWMODEL : LEU E42 :N, CA, CD2 , CD1 , CG, CB, C, O 

NEWMODEL : ALA E73 :N, CA, CB, 0,0 

NEWMODEL : ALA E74 :N, CA, CB, 0,0 

NEWMODEL: LEU E75 : N , CA , CD2 , CD1 , CG, CB , 0 , O 

NEWMODEL : ASN E7 6 : N , CA , ND2 , OD 1 , CG , CB , C , O 
45 NEWMODEL : LEU E8 2 : N , CA , CD2 , CD1 , CG , CB , C , O 

NEWMODEL : GLY E83:N,CA,C,0 

NEWMODEL: VAL E84 : N, CA, CG2 , CGI , CB, C, O 

NEWMODEL : ALA E85:N, CA, CB, 0,0 

NEWMODEL: PRO E86 :N, CD, CA, CG, CB, C, 0 
50 NEWMODEL: SER E103 : N, CA, OG, CB, 0,0 

NEWMODEL: VAL E104 : N , CA, CG2 , CGI , CB, C, O 

NEWMODEL : SER E105:N,CA,OG,CB,C,0 

NEWMODEL : ALA E108:N,CA,CB,C,O 

NEWMODEL : ALA E122:N,CA,CB,C,0 
55 NEWMODEL : ALA E133:N,CA,CB,C,0 

NEWMODEL: THR E134 : N , CA , CG2 , 0G1 , CB , C, O 

NEWMODEL: LEU E135 : N, CA, CD2 , CD1 , CG , CB , C , O 
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10 



15 



5 



NEWMODEL: GLN 
NEWMODEL : ALA 
NEWMODEL : VAL 
NEWMODEL :ASN 
NEWMODEL: VAL 
NEWMODEL: VAL 
NEWMODEL : ASN 
NEWMODEL : ASN 
NEWMODEL: GLN 
NEWMODEL : GLY 
NEWMODEL: SER 
NEWMODEL :THR 
NEWMODEL: ALA 
NEWMODEL: SER 
NEWMODEL: VAL 
NEWMODEL: ALA 
NEWMODEL: GLY 
NEWMODEL: GLY 
NEWMODEL: ASN 



E137:N,CA, NE2 , OE1 , CD , CG , CB , C , O 

E138:N, CA,CB,C,0 

E139:N,CA, CG2 ,CG1 ,CB ,C ,0 

E140:N,CA,ND2 ,001,00,06,0,0 

E149:N,CA,CG2,CG1,CB,C,0 

E150:N, CA, CG2 ,CG1,CB,C,0 

E173:N,CA,ND2,0D1,CG,CB,C,0 

E204:N,CA,ND2,ODl,CG,CB,C,O 

E206:N,CA,NE2,OE1,CD,CG,CB,C,0 

E211:N,CA,C,0 

E212:N,CA,OG,CB,C,0 

E213:N,CA,CG2,0G1,CB,C,0 

E215:N,CA,CB,C,0 

E216:N,CA,OG,CB,C,0 

E227:N,CA,CG2,CG1,CB,C,0 

E228:N,CA,CB,C,0 

E229:N,CA,C,0 

E258:N,CA,C,0 

E269:N,CA,ND2,0D1,CG,CB,C,0 



20 



Example 3 

Suitable substitutions in PD498 for addition of carboxylic acid 
attachment groups (-COOH) . 

The 3D structure of PD498 was modeled as described in 
25 Example 1. 

Suitable locations for addition of carboxylic attachment groups 
(Aspartatic acids and Glutamic acids) were found as follows. 
The procedure described in Example 1 was followed. The 
commands performed in Insight (BI0SYM) are shown in the command 
3 0 files makeDEzone.bcl and makeDEzone2 .bcl below: 



Conservative substutitions: 

makeDEzone • bcl 

Delete Subset * 
35 Color Molecule Atoms * Specified Specification 255,0,255 

Zone Subset ASP :asp:od* Static monomer /residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer /residue 10 Color_Subset 
255,255,0 

40 #NOTE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERM : 280:0 Static monomer /residue 10 Color_Subset 
255,255,0 

#N0TE: editnextline ACTSITE residues according to the protein 
45 Zone Subset ACTSITE : 39,72,226 Static monomer/residue 8 

Color_Subset 255,255,0 

Combine Subset ALLZONE Union ASP GLU 

Combine Subset ALLZONE Union ALLZONE CTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 
50 #N0TE : editnextline object name according to the protein 

Combine Subset REST Difference PD498FINALMODEL ALLZONE 
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List Subset REST Atom Output File restatom. list 
List Subset REST monomer/ residue output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output File actsiteatom. list 
5 List Subset ACTSITE monomer/ residue Output_File 
actsitemole. list 
# 

Zone Subset REST5A REST Static Monomer/Residue 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 

10 Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255 , 255 , 255 
List Subset SUB5B Atom Output JFile sub5batom. list 
List Subset SUB5B monomer /residue Output_File subSbmole. list 
#Now identify sites for asn->asp & gln->glu substitutions and 

15 ... 

#continue with makezone2.bcl. 

#Use grep command to identify asn/gln in restatom. list — 
#sub5batom. list & accsiteatom. list 

20 Comments: 

The subset REST contains Gln33 and Asn245, SUB5B contains 
Glnl2 f Glnl26, Asn209, Gln242, Asn246, Gln248 and Asn266, all 
of which are solvent exposed. 

The substitutions Q12E or Q12D, Q33E or Q33D, Q126E or 
25 Q126D, N209D or N209E, Q242E or Q242D, N245D or N245E, N246D or 
N246E, Q248E or Q248D and N266D or N266E are identified in 
PD498 as sites for mutagenesis within the scope of this 
invention. Residues are substituted below in section 2, and 
further analysis done: 

30 

Non-conservative substitutions: 
makeDEzone2 .bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

35 #having scanned lists (grep gln/asn command) and identified 
sites for . . . 

#asn->asp & gln->glu substitutions 

#N0TE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace PD4 9 8 FINALMODEL newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On PD4 9 8 FINALMODEL 

#NOTE : editnext lines with asn->asp & gln->glu positions 

Replace Residue newmodel: 33 glu L 
45 Replace Residue newmodel: 245 asp L 

Replace Residue newmodel: 12 glu L 

Replace Residue newmodel: 12 6 glu L 

Replace Residue newmodel: 209 asp L 

Replace Residue newmodel: 242 glu L 
50 Replace Residue newmodel: 246 asp L 

Replace Residue newmodel: 248 glu L 
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Replace Residue newmodel: 266 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu, . . . 
#now including introduced asp & glu 
5 Color Molecule Atoms newmodel Specified Specification 255,0,255 
Zone Subset ASPx newmodel : asp rod* Static monomer /residue 10 
Color_Subset 255,255,0 

Zone Subset GLUx newmodel: glu :oe* Static monomer /residue 10 
Color_Subset 255,255,0 
10 #NOTE : editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel : 280 :0 Static monomer/residue 10 
Color_Subset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 
15 Zone Subset ACTSITEx newmodel: 39, 72 ,226 Static monomer/ residue 

8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
20 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output_File restxatom. list 

List Subset RESTx monomer /residue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
25 List Subset ACTSITEx Atom Output^File actsitexatom. list 
List Subset ACTSITEx monomer/ residue OutputJFile 
acts itexmole. list 
# 

#read restxatom. list or restxmole. list to identify sites for 
30 (not_gluasp) ->gluasp . • . 
#subst. if needed 

Comments : 

The subset RESTx contains only two residues: A233 and G234, 

35 none of which are solvent exposed. No further mutagenesis is 

required to obtain complete protection of the surface. 

However, it may be necessary to remove some of the reactive 

carboxylic groups in the active site region to ensure access to 

the active site of PD498. Acidic residues within the subset 

40 ACTSITE are: D39, D58, D68 and D106. Of these only the two 

latter are solvent exposed and D39 is a functional residue. The 

mutations D68N, D68Q, D106N and D106Q were found suitable 

according to the present invention. 

Relevant data for Example 3: 

45 Solvent accessibility data for PD498MODEL: see Example 1 above. 

Subset REST: 

restmole. list 
Subset REST: 

PD498FINALMODEL: 10-11, 33-35, 54-55 ,129-130, 
50 221,233-234,236,240,243, 

PD4 9 8 F INALMODEL : 245 , 262 ,264-265 
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restatom. list 
Subset REST: 

PD4 9 8 FINALMODEL : ALA 10 :N, CA, C,0, CB 
5 PD4 9 8 FINALMODEL :TYR 11 : N, CA, C, 0, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ, OH 
PD498 FINALMODEL : GLN 3 3 : N , CA , C , O , CB , CG , CD , OE1 , NE2 
PD4 9 8 FINALMODEL : THR 3 4 : N , CA , C , O , CB , OG1 , CG2 
PD4 9 8 FINALMODEL :VAL 35 :N, CA, C, O, CB, CGI, CG2 
PD4 9 8 FINALMODEL : ILE 54 : N , CA , C , O , CB , CGI , CG2 , CD1 
10 PD4 9 8 FINALMODEL : LYS 55:N,CA,C,0,CB,CG,CD,CE,NZ 
PD4 9 8 FINALMODEL :LYS 129:N,CA,C,0,CB,CG,CD,CE,NZ 
PD 4 9 8 FI N ALMOD EL : VAL 130:N,CA,C,O,CB,CGl,CG2 

PD4 9 8 FINALMODEL :TYR 221 :N, CA, C,0, CB, CG, CD1, CD2 , CE1 , CE2 , CZ ,OH 

PD4 9 8 FINALMODEL: ALA 233 :N, CA, C f O, CB 
15 PD498FINALMODEL:GLY 234:N,CA,C,0 

PD498 F INALMODEL : ALA 2 3 6 : N , CA f C , O , CB 

PD4 9 8 FINALMODEL: ALA 240 : N , CA, C,0, CB 

PD4 9 8 FINALMODEL :GLY 243:N,CA,C,0 

PD4 9 8 FINALMODEL : ASN 2 4 5 : N , CA , C , O , CB , CG , ODl , ND2 
20 PD4 9 8 FINALMODEL :GLY 262:N,CA,C,0 

PD4 9 8 FINALMODEL :GLY 264:N,CA,C,0 

PD4 9 8 F I N ALMODEL : THR 2 6 5 : N , CA , C , O , CB , OG1 , CG2 
Subset SUB5B: 
subSbmole. list 
25 Subset SUB5B: 

PD4 9 8 FINALMODEL: 6-9 , 12-13 ,31-32 , 51-53 , 56,81, 93-94 , 97- 

99,122,126-128, 

PD4 9 8 FINALMODEL: 131 , 155-157 , 159 , 197-199 , 209 , 211 , 219- 
220,232,235, 

30 PD4 9 8 FINALMODEL: 237-239, 241-242, 244, 246-249, 253,260- 

261,263,266-268 
subSbatom. list 

Subset SUB5B: 

PD4 9 8 FINALMODEL: PRO 6 : N , CA, CD , C, 0 , CB, CG 
35 PD4 9 8FINALMODEL : TYR 7 : N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8FINALMODEL : TYR 8:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2 ,CZ,OH 

P D 4 9 8 F I N ALMO DEL : S ER 9 : N , C A , C , O , CB , OG 

PD4 9 8 F INALMODEL : GLN 12 :N, CA,C,0,CB,CG, CD,0E1,NE2 

PD4 9 8 FINALMODEL : TYR 13 : N, CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
40 PD4 9 8 FINALMODEL :SER 31:N, CA, C,0, CB,.OG 

PD4 98FINALMODEL : THR 3 2 : N , CA , C , O , CB , OGl , CG2 

PD4 9 8FINALMODEL : ARG 51 :N, CA, C,0, CB, CG, CD,NE, CZ,NH1,NH2 

PD4 9 8 FINALMODEL : LYS 5 2 : N , CA , C , O , CB , CG , CD , CE , NZ 

PD4 9 8 FINALMODEL : VAL 5 3 : N , CA , C , 0 , CB , CGI , CG2 
45 PD4 9 8 FINALMODEL :GLY 56:N,CA,C,0 

PD4 9 8 FINALMODEL: ALA 81:N,CA,C,0,CB 

PD4 9 8 FINALMODEL : MET 9 3 : N , CA , C , O , CB , CG , SD , CE 

PD4 9 8 FINALMODEL : ALA 9 4 : N , CA , C , O , CB 

PD4 9 8 FINALMODEL : THR 97 : N , CA , C , 0 , CB , OGl , CG2 
50 PD4 9 8 FINALMODEL: LYS 98 : N, CA, C,0, CB, CG, CD, CE, NZ 

PD4 9 8 FINALMODEL : ILE 99 : N , CA , C , O , CB , CGI , CG2 , CD1 

PD4 9 8 FINALMODEL : TYR 12 2 : N , CA, C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL: GLN 126 :N, CA, C, 0, CB, CG, CD,0E1,NE2 

PD4 9 8 FINALMODEL : GLY 127:N,CA,C,0 
55 PD498FINALMODEL: ALA 128 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL : LEU 13 1 : N , CA, C, O , CB , CG , CD1 , CD2 

PD4 9 8 FINALMODEL : GLY 155:N,CA,C,0 
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PD4 9 8 FINALMODEL : 
PD4 9 8 FI NALMODEL : 
PD 4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
5 PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
10 PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD4 98 FINALMODEL : 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
15 PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD498 FI NALMODEL : 
20 PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL: 
25 PD4 9 8 FINALMODEL : 
PD498FINALMODEL: 
PD4 9 8 FINALMODEL : 
PD498 FINALMODEL : 
PD4 9 8 FINALMODEL: 
3 0 Subset ACTSITE: 

actsitemole . list 
Subset ACTSITE: 

PD4 9 8 FINALMODEL: 36-42,57-60, 66-80,100-110, 
115-116 , 119 , 132-136 , 160-164 , 
35 PD4 9 8 FINALMODEL: 182-184 , 194 , 206-2 07 , 210 , 

212-215,222-231 
actsiteatom. list 
Subset ACTSITE: 

PD4 9 8 FINALMODEL : ALA 36:N,CA,C,0,CB 
40 PD4 9 8 FINALMODEL :VAL 37 :N, CA, C, 0, CB, CGI , CG2 

PD4 9 8 FINALMODEL: LEU 3 8 : N, CA, C , 0 , CB, CG, CD1 , CD 2 
PD4 9 8 FINALMODEL: ASP 39 :N, CA, C, 0, CB, CG,ODl , OD2 
PD4 9 8 FINALMODEL : SER 4 0 : N , CA , C , O , CB , OG 
PD4 9 8 FINALMODEL :GLY 41:N,CA,C,0 
45 PD4 9 8 FINALMODEL :VAL 42 :N, CA, C, 0, CB, CGI , CG2 

PD498 FINALMODEL : TYR 

5 7 : N , CA , C , O , CB , CG , CD 1 , CD2 , CE1 , CE2 , CZ,OH 
PD4 9 8 FINALMODEL : ASP 5 8 : N , CA , C , 0 , CB , CG , OD 1 , OD2 
PD4 9 8 FINALMODEL : PHE 
50 59:N,CA,C,0,CB,CG, CD1 , CD2 ,CE1,CE2 ,CZ 

PD4 9 8 FINALMODEL : ILE 6 0 : N , CA , C , O , CB , CG 1 , CG2 , CD 1 
PD49 8 FINALMODEL : PRO 6 6 : N , CA , CD , C , 0 , CB , CG 
PD4 9 8 FINALMODEL : MET 67 : N , CA , C , O , CB , CG , SD , CE 
PD4 9 8 FINALMODEL : ASP 6 8 : N , CA , C , 0 , CB , CG , OD1 , OD2 
55 PD4 9 8 FINALMODEL: LEU 6 9 : N , CA , C , 0 , CB , CG , CD 1 , CD 2 

PD4 9 8 FINALMODEL :ASN 7 0 : N , CA , C , 0 , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL :GLY 71:N,CA,C,0 



ALA 


156 


:N, 


C A ,C,0 , 


CB 










VAL 


157 


:N, 


CA,C,0, 


CB, 


CGI , CG2 








VAL 


159: 


>N, 


CA,C,0, 


CB, 


CGI , CG2 








TYR 


197: 


:N, 


CA,C,0, 


CB, 


CG,CD1,CD2,CE1 


,CE2 


,CZ 


,OH 


GLY 


198: 


:N, 


CA,C,0 












THR 


199 


:N, 


CA,C,0, 


CB, 


0G1,CG2 








ASN 


209 


:N, 


CA,C,0, 


CB, 


CG,0D1,ND2 








ALA 


211: 


:N, 


CA,C,0, 


CB 










TYR 


219: 


:N, 


CA,C,0, 


CB, 


CG,CD1,CD2,CE1 


,CE2 


,CZ 


,OH 


SER 


220: 


:N, 


CA,C,0, 


CB, 


OG 








VAL 


232: 


:N, 


CA,C,0, 


CB, 


CG1,CG2 








LEU 


235 


:N, 


CA,C,0, 


CB, 


CG,CD1,CD2 








ALA 


237 


:N, 


CA,C,0, 


CB 










LEU 


238: 


•N, 


CA, C,0, 


CB, 


CG,CD1,CD2 








LEU 


239: 


:N, 


CA,C,0, 


CB, 


CG,CD1,CD2 








SER 


241 


;N, 


C A , C , 0 , 


CB, 


OG 








GLN 


242 


!N, 


CA,C,0, 


CB, 


CG,CD,0E1,NE2 








LYS 


244: 


:N, 


CA,C,0, 


CB, 


CG,CD,CE,NZ 








ASN 


246: 


:N, 


CA,C,0, 


CB, 


CG,0D1,ND2 








VAL 


247 


'N, 


C A , C , 0 , 


CB, 


CGI , CG2 








GLN 


248 


!N, 


C A , C , 0 , 


CB, 


CG,CD,OEl,NE2 








ILE 


249 


:N r 


CA,C,0, 


CB, 


CG1,CG2,CD1 








ILE 


253, 


:N, 


CA,C,0, 


CB, 


CG1,CG2,CD1 








ILE 


260 


!N, 


CA,C,0, 


CB, 


CG1,CG2,CD1 








SER 


261 


;N, 


CA,C,0, 


CB, 


OG 








THR 


263 


:N, 


C A , C , 0 , 


CB, 


0G1,CG2 








ASN 


266 


:N, 


CA , C , 0 , 


CB, 


CG,0D1,ND2 








PHE 


267 


;N, 


CA , C , 0 , 


CB, 


CG,CD1,CD2,CE1 


,CE2 


,cz 




LYS 


268, 


:N, 


CA ,C,0, 


CB, 


CG, CD, CE,NZ 
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PD4 9 8 FINALMODEL : HI S 
PD4 9 8 FINALMODEL : GLY 
PD4 9 8 FINALMODEL : THR 
PD498 FINALMODEL : HI S 
5 PD4 9 8 FINALMODEL : VAL 

PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL: GLY 
PD4 9 8 FINALMODEL : THR 
PD4 98 FINALMODEL : VAL 

10 PD498 FINALMODEL : LEU 

PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : VAL 
PD4 9 8 FINALMODEL : ARG 
CG,CD,NE,CZ,NH1 

1 5 PD4 9 8 FINALMODEL : VAL 

PD4 9 8 FINALMODEL : LEU 
PD4 9 8 FINALMODEL : ASP 
PD498 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : ASN 

2 0 PD4 9 8 FINALMODEL : GLY 

PD4 9 8 FINALMODEL : SER 
PD4 9 8 FINALMODEL: SER 
PD498FINALMODEL: ILE 
061,002,001 

25 PD498 FINALMODEL : GLY 

PD4 9 8 FINALMODEL : ASN 
PD4 9 8 FINALMODEL : LEU 
PD498FINALM0DEL: SER 
PD4 9 8 F I N ALMODEL : LEU 

30 PD498 FINALMODEL : GLY 

PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : GLY 

35 PD498 FINALMODEL : ASN 

PD4 9 8 FINALMODEL : VAL 
PD498 FINALMODEL : GLY 
PD4 98 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : PHE 

40 00,001,002, CE1, 

PD4 9 8 FINALMODEL : PRO 
PD4 9 8 FINALMODEL : GLY 
PD498FINALMODEL: ILE 
CGI r CG2 ff CD-I 

4 5 PD4 9 8 FINALMODEL : SER 

PD4 9 8 FINALMODEL : THR 
PD4 9 8 FINALMODEL : VAL 
PD498 FINALMODEL : PRO 
PD4 ? 8 FINALMODEL : MET 

50 PD498 FINALMODEL : SER 

PD4 9 8 FINALMODEL : GLY 
PD4 9 8 FINALMODEL : THR 
PD498 FINALMODEL : SER 
PD498 FINALMODEL : MET 

55 PD498 FINALMODEL : ALA 

PD498FINALMODEL : SER 
PD4 9 8FINALM0DEL : PRO 



72:N / CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
73:N,CA,C,0 

74:N,CA,C,0,CB,OG1,CG2 

75:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

7 6 : N , OA , C , 0 , CB , CG 1 , CG2 

77:N,CA,C,0,CB 

78:N,CA,C,0 

79:N,CA,C,0,CB,OGl,CG2 
80:N,CA,C,O,CB,CGl,CG2 



100:N, 


OA, 


C,0,CB,CG,CD1,CD2 


101:N, 


OA, 


C,0,CB 


102:N, 


OA, 


C,0,CB,CG1,CG2 


103:N, 


CA, 


0,0, CB, 


,NH2 






104:N, 


OA, 


C,0,CB,CG1,CG2 


105:N, 


OA, 


0,0,08,00,001,002 


106:N, 


OA, 


C,0,CB,CG,ODl,OD2 


107:N, 


CA, 


C,0,CB 


108:N, 


CA, 


C,0,CB,CG,0D1,ND2 


109:N, 


CA, 


C,0 


110:N, 


CA, 


C,0,CB,OG 


115:N, 


CA, 


C,0,CB,OG 


116:N, 


CA, 


C,0,CB, 


119:N, 


CA, 


C,0 


132:N, 


CA, 


C,0,CB,CG,0D1,ND2 


133 : N 


CA, 


C . O . CB . CG . CD1 . CD2 


134:N, 


CA, 


C,0,CB,OG 


135:N, 


CA, 


C,O,CB,0G,CDl,CD2 


136:N, 


CA, 


C,0 


160:N, 


CA, 


0,0, CB 


161:N, 


CA, 


C,0,CB 


162:N, 


CA # 


C,0,CB 


163:N, 


CA, 


C,0 


164:N, 


CA, 


C, 0, CB , CG , 0D1 , ND2 


182:N, 


CA, 


C,0,CB,CG1,CG2 


183 :N, 


CA, 


C,0 


184:N, 


CA, 


0,0, CB 


194:N, 


CA, 


C,0,CB, 


CE2,CZ 






206:N, 


CA, 


CD , C , 0 , CB , CG 


207:N, 


CA, 


C,0 


210:N, 


CA, 


C,0,CB, 


212:N, 


CA, 


C,0,CB,OG 


213:N, 


CA, 


C,0,CB,0G1,CG2 


214:N, 


CA, 


C,0,CB,CG1,CG2 


215:N, 


CA, 


CD,C,0,CB,CG 


222:N, 


CA, 


C,0,CB,CG,SD,OE 


223:N, 


CA, 


C,0,CB,OG 


224:N, 


CA, 


,0,0 


225:N, 


CA, 


,C,0,CB,0G1,CG2 


226:N, 


CA, 


C,0,CB,OG 


227:N, 


CA, 


r C,0,CB,CG,SD,CE 


228:N, 


CA, 


,C,0,CB 


229:N, 


,CA, 


,C,0,CB,OG 


230:N, 


CA, 


,CD,C,0,CB,CG 
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PD4 9 8 FINALMODEL : HIS 231:N, CA, C,0,CB, 
CG , ND1 , CD2 , CE1 , NE2 
Subset RESTx: 

restxmole . list 
5 Subset RESTX: 

NEWMODEL: 23 3-234 
restxatom. list 
Subset RESTX: 

NEWMODEL : ALA 233 : N, CA, C, O, CB 
10 NEWMODEL : GLY 234:N,CA,C,0 

Example 4 

suitable substitutions in the Arthromyces ramosus peroxidase 
for addition of carboxylic acid attachment gr oups r-COOH) 
15 Suitable locations for addition of carboxylic attachment 
groups (Aspartatic acids and Glutamic acids) in a non- 
hydrolytic enzyme, Arthromyces ramosus peroxidase were found as 
follows. 

The 3D structure of this oxido-reductase is available in the 
20 Brookhaven Databank as larp.pdb. This A. ramosus peroxidase 
contains 344 amino acid residues. The first eight residues are 
not visible in the X-ray structure: QGPGGGGG, and N143 is 
glycosylated . 

The procedure described in Example 1 was followed. 
25 The amino acid sequence of Arthromyces ramosus Peroxidase 

(E.C.I. 11. 1.7) is shown in SEQ ID NO 4. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeDEzone. bcl and makeDEzone2 .bcl below. The C- 

terminal residue is P344, the ACTSITE is defined as the heme 
30 group and the two histidines coordinating it (H56 & H184) . 

Conservative substitutions: 

makeDEzone. bcl 

Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
35 Zone Subset ASP :asp:od* Static monomer/residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline C-terminal residue number according to the 
40 protein 

Zone Subset CTERM : 344:0 Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE :HEM,56,184 Static monomer /residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union ASP GLU 
Combine Subset ALLZONE Union ALLZONE CTERM 



WO 98/35026 



74 



PCT/DK98/00046 



Combine Subset ALLZONE Union ALL ZONE ACTSITE 
#NOTE: editnextline object name according to the protein 
Combine Subset REST Difference ARP ALLZONE 
List Subset REST Atom Output File restatom. list 
5 List Subset REST monomer/ residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output^File act siteatom. list 
List Subset ACTSITE monomer /residue Output_File 
actsitemole. list 
10 # 

Zone Subset REST5A REST Static Monomer /Residue 5 -Color_Subset 
Combine Subset SUB 5 A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
15 List Subset SUB5B Atom Output File subSbatom. list 

List Subset SUB5B monomer /residue Output_File subSbmole. list 
#Now identify sites for asn->asp & gln->glu substitutions and 
* • • 

#continue with makezone2.bcl. 
20 #Use grep command to identify asn/gln in restatom. list ... 
#sub5batom. list & accsiteatom. list 

Comments : 

The subset REST contains Gln70, and SUB5B contains Gln34, 
25 Asnl28, Asn303 all of which are solvent exposed. The 

substitutions Q34E or Q34D, Q70E or Q70D, N128D or N128E and 
N303D or N303E are identified in A. ramosus peroxidase as sites 
for mutagenesis. Residues are substituted below and further 
analysis done: 

30 

Non-conservative substitutions: 
makeDEzone2 • bcl 

#sourcefile makezone2.bcl Claus von der Osten 961128 
# 

35 #having scanned lists (grep gln/asn command) and identified 
sites for . . . 

#asn->asp & gln->glu substitutions 

#N0TE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace ARP newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On ARP 

#NOTE: editnextlines with asn->asp & gln->glu positions 
Replace Residue newmodel: 34 glu L 
45 Replace Residue newmodel: 70 glu L 
Replace Residue newmodel: 128 asp L 
Replace Residue newmodel: 303 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu, ... 
50 #now including introduced asp & glu 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
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Zone Subset ASPx newmodel: asp :od* Static monomer /residue 10 
Color_Subset 255,255,0 

Zone Subset GLUx newmodel:glu:oe* Static monomer/ residue 10 
Color_Subset 255,255,0 
5 #NOTE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel : 344 :0 Static monomer /residue 10 
Color_Subset 255,255,0 

#N0TE : editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel: HEM, 56, 184 Static monomer /residue 

8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output File restxatom. list 

List Subset RESTx monomer /residue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
20 List Subset ACTSITEx Atom Output^File acts itexa torn. list 
List Subset ACTSITEx monomer /residue Output_File 
actsitexmole. list 
# 

#read restxatom, list or restxmole, list to identify sites for 
25 (not_gluasp) ->gluasp . . . 
#subst. if needed 

Comments : 

The subset RESTx contains only four residues: S9, S334, G335 

30 and P336, all of which are >5% solvent exposed. The mutations 
S9D, S9E, S334D, S334E, G335D, G335E, P336D and P336E are 
proposed in A. ramosus peroxidase. Acidic residues within the 
subset ACTSITE are: E44 , D57, D77, E87, E176, D179, E190, D202, 
D209, D24 6 and the N-terminal carboxylic acid on P344. Of these 

35 only E44, D77, E176, D179, E190, D209, D246 and the N-terminal 
carboxylic acid on P344 are solvent exposed. Suitable sites for 
mutations are E44Q, D77N, E176Q, D179N, E190Q, D209N and D246N. 
D246N and D246E are risky mutations due to D246's importance 
for binding of heme, 

40 The N-terminal 8 residues were not included in the 

calculations above, as they do not appear in the structure. 
None of these 8 residues, QGPGGGG, contain carboxylic groups. 
The following variants are proposed as possible mutations to 
enable attachment to this region: Q1E, Q1D, G2E, G2D, P3E, P3D, 

45 G4E, G4D, G5E, G5D, G6E, G6D, G7E, G7D, G8E, G8D. 
Relevant data for Example 4: 
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Solvent accessibility data for A. ramosus peroxidase (Note: 
as the first eight residues are missing in the X-ray structure, 
the residue numbers printed in the accessibility list below are 
8 lower than those used elsewhere for residue numbering. 



5 


# ARP 


Thu Jan 30 15:39:05 MET 1997 




# residue 


area 




SER 1 


143.698257 




VAL_2 


54.879990 




THR 3 


86.932701 


10 


CYS 4 


8.303715 




PRO 5 


126.854782 




GLY 6 


53.771488 




GLY 7 


48.137802 




GLN 8 


62.288475 


15 


SER 9 


79.932549 




THR 10 


16.299215 




SER 11 


81.928642 




ASN 12 


51.432678 




SER 13 


81.993019 


20 


GLN 14 


92.344009 




CYS 15 


0.000000 




CYS 16 


32.317432 




VAL 17 


54. 067810 




TRP 18 


6.451035 


25 


PHE 19 


25.852070 




ASP 20 


79.033997 




VAL 21 


0.268693 




LEU 22 


22.032858 




ASP 23 


90.111404 


30 


ASP 24 


43.993240 




LEU 25 


1.074774 




GLN 26 


25.589321 




THR 27 


82.698059 




ASN 28 


96.600883 


35 


PHE 29 


32.375275 




TYR 30 


5.898365 




GLN 31 


103.380585 




GLY 32 


40.042034 




SER 33 


46.789322 


40 


LYS 34 


87.161873 




CYS 35 


12.827215 




GLU 36 


51.582657 




SER 37 


16.378180 




PRO 38 


33.560043 


45 


VAL 39 


6.448641 




ARG 40 


7.068311 




LYS 41 


15.291286 




ILE 42 


1.612160 




LEU 43 


1.880854 


50 


ARG 44 


16.906845 




ILE 45 


0.000000 




VAL 46 


2.312647 




PHE 47 


2.955627 




HIS 48 


20.392527 


55 


ASP 49 


4.238116 
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ALA 


50 


0.510757 




ILE 


51 


1.576962 




GLY 


52 


2.858601 




PHE 


"53 


48. 633503 


5 


SER 


"54 


8.973248 




PRO 


55 


58.822315 




ALA" 


56 


59.782852 




LEU" 


"57 


46.483955 




thr" 


58 


86.744827 


10 


ALA 


"59 


89.515816 




ALA 


60 


81.163239 




GLY" 


"61 


70. 119019 




GLN" 


'62 


112.635498 




phe" 


"63 


93.522354 


15 


GLY" 


"64 


2.742587 




GLY~ 


"65 


13.379636 




GLY~ 


"66 


22.722847 




GLY" 


'67 


0.000000 




ALA" 


"68 


0.268693 


20 


asp" 


"69 


12.074840 




GLY 


"70 


0.700486 




SER~ 


'71 


0.000000 




ILE" 


"72 


0.000000 




ILE~ 


"73 


0.000000 


25 


ALA" 


"74 


17.304443 




his" 


"75 


41.071186 




ser" 


"76 


20.000793 




asn" 


"77 


120.855316 




ile" 


"78 


66. 574982 


30 


GLU" 


"79 


2.334954 




LEU" 


"80 


41.329689 




ALA" 


81 


77.370575 




PHE 


"82 


38.758774 




PRO" 


"83 


131.946289 


35 


ALA" 


"84 


34.893864 




ASN" 


"85 


5.457000 




GLY" 


"86 


43.364151 




GLY" 


"87 


51.561348 




LEU* 


"88 


0.242063 


40 


THR" 


"89 


73.343575 




asp" 


"90 


130.139389 




THR" 


"91 


17.863211 




ILE" 


"92 


0.268693 




GLU" 


"93 


92.210396 


45 


ALA" 


"94 


35.445068 




LEU" 


"95 


1.343467 




arg" 


"96 


31.175611 




ALA" 


"97 


44. 650192 




VAL" 


"98 


17. 698566 


50 


GLY" 


"99 


1.471369 




ILE" 


"100 


62.441463 




ASN" 


"101 


107.139748 




HIS" 


"102 


46.952496 




GLY" 


"103 


46.559296 


55 


VAL" 


"104 


11.342628 




ser" 


"105 


15.225677 




PHE" 


"106 


6.422011 
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GLY 


107 


3.426864 




ASP 


108 


10.740790 




LEU 


109 


0.268693 




ILE 


110 


1.880854 


5 


GLN 


111 


31.867456 




PHE 


112 


0.000000 




ALA 


113 


0.000000 




THR 


114 


3. 656114 




ALA 


115 


8.299393 


10 


val" 


116 


0.268693 




gly" 


117 


0.268693 




MET_ 


118 


3.761708 




SER 


119 


14.536770 




ASN 120 


25.928799 


15 


CYS 


121 


0.537387 




pro" 


122 


29.798336 




gly 


123 


33.080013 




SER 


124 


17.115562 




pro" 


'125 


36.908714 


20 


ARG 


'126 


108. 274727 




LEU 


"127 


21.238588 




GLU 


'128 


53 .742313 




PHE~ 


'129 


3.761708 




leu" 


'130 


12.928699 


25 


thr" 


131 


10.414591 




GLY" 


"132 


47.266495 




ARG" 


'133 


12.247048 




SER" 


"134 


63.047237 




ASN" 


"135 


31.403708 


30 


SER" 


"136 


97.999619 




SER~ 


"137 


28.505201 




GLN" 


"138 


102.845520 




pro" 


"139 


49.691917 




ser" 


"140 


9.423104 


35 


pro" 


"141 


25.724171 




pro" 


"142 


80.706665 




ser" 


"143 


105.318176 




leu" 


"144 


20.154398 




ile" 


"145 


41.288322 


40 


pro" 


"146 


10.462679 




gly" 


'147 


19.803421 




pro" 


"148 


18.130360 




gly" 


"149 


47.391853 




ASN 


"150 


60.248917 


45 


THR" 


"151 


87.887985 




VAL* 


"152 


13.870322 




THR" 


"153 


74.664734 




ALA" 


"154 


45.251106 




ILE' 


"155 


2.686934 


50 


LEU 


"156 


28.720940 




asp' 


"157 


110.081253 




arg" 


"158 


31.228874 




MET 


"159 


1.612160 




GLY" 


160 


38.223858 


55 


asp" 


"161 


46.293152 




ALA' 


162 


9.877204 




GLY' 


"163 


34.267326 
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PHE 


164 


11,057570 




SER 165 


51.158882 




PRO 


166 


62.767738 




ASP 


167 


75.164917 


5 


GLU 


168 


43.334976 




VAL 


169 


6.365355 




VAL 


170 


2.955627 




ASP 


171 


7.004863 




LEU~ 


172 


1.880854 


10 


LEU"" 


173 


3.197691 




ALA 


174 


0.000000 




ALA"" 


175 


1.074774 




HIS~ 


176 


0.502189 




SER 


177 


0.806080 


15 


LEU 


178 


3.197691 




ALA 


179 


3.337480 




SER" 


180 


0.466991 




GLN 


"181 


2. 122917 




GLU~ 


182 


40.996552 


20 


GLY 


183 


62.098671 




LEU 


184 


23.954853 




ASN 


185 


15.918136 




SER 


186 


95.185318 




ALA" 


187 


59.075272 


25 


ILE 


188 


27.675419 




PHE 


189 


102.799423 




arg" 


190 


55.265549 




SER 


191 


6.986028 




PRO" 


'192 


2.686934 


30 


LEU 


193 


12.321225 




ASP 


"l94 


2.127163 




SER 


'195 


33.556419 




THR 


"196 


33.049286 




pro" 


"197 


20.874798 


35 


gln" 


"198 


65.729698 




val" 


"199 


31.705818 




PHE 


"200 


4.753195 




ASP" 


"201 


13.744506 




thr" 


"202 


1.612160 


40 


gln" 


*203 


16.081930 




phe" 


"204 


2.581340 




tyr" 


"205 


1.880854 




ile" 


"206 


9.356181 




GLU" 


"207 


0.735684 


45 


thr" 


"208 


10.685907 




leu" 


"209 


9.672962 




leu" 


[210 


2.955627 




T V C 


Oil 

1 1 1 


77.176834 




GLY~ 


"212 


40.968609 


50 


thr" 


"213 


•"TO ""7 *1 OT 1 C 

78 . 718216 




thr" 


"214 


21.738384 




gln" 


"215 


77.622299 




pro" 


"216 


25.441587 




GLY" 


"217 


8.320850 


55 


PRO" 


"218 


96.972305 




SER" 


"219 


64.627823 




LEU" 


"220 


85.732414 
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GLY 


221 


27.361111 




PHE~ 


222 


134.620178 




ALA 


223 


3.873014 




GLU 


'224 


12.141763 


5 


GLU~ 


225 


65.129868 




LEU 


226 


76.105843 




SER 


'227 


0.268693 




PRO" 


'228 


7.017754 




PHE 


"229 


0.000000 


10 


PRO 


230 


47.827423 




GLY 


231 


23.790522 




GLU*" 


"232 


6.643466 




PHE 


233 


6.713862 




ARG 


"234 


18,012030 


15 


MET" 


235 


4.598188 




ARG 


236 


91.415581 




SER" 


"2 37 


1.982125 




ASP" 


"238 


6.246871 




ALA" 


"239 


12.897283 


20 


leu" 


"240 


76.820526 




leu" 


"241 


3.224321 




ALA 242 


1.400973 




ARG 243 


77.207176 




ASP 


244 


36.207306 


25 


SER" 


"245 


104.023796 




arg" 


"246 


121.852341 




thr" 


"247 


2.955627 




ALA" 


"248 


4.810700 




CYS 


"249 


47.331306 


30 


ARG" 


"250 


62.062778 




TRP" 


"251 


2.418241 




GLN 


"252 


5.554953 




SER" 


"253 


38.284832 




met" 


"254 


1.124224 


35 


thr" 


"255 


0.000000 




ser" 


"256 


53.758987 




ser" 


"257 


37.276134 




ASN 258 


44.381340 




GLU 259 


149.565140 


40 


VAL 


260 


57.500389 




met" 


"261 


2.679314 




GLY" 


"2 62 


10.175152 




GLN" 


"263 


107.458916 




ARG** 


"264 


36.402130 


45 


TYR- 


"265 


0.233495 




arg" 


"266 


91.179619 




ALA" 


"267 


53.708500 




ALA" 


"268 


6.504294 




met" 


"269 


17.122011 


50 


ALA" 


"270 


22.455158 




LYS" 


"271 


73.386177 




MET" 


"272 


3.959508 




ser" 


"273 


15.043281 




val" 


"274 


23.887930 


55 


LEU 


"275 


17.196379 




GLY*" 


276 


44.362202 




PHE' 


277 


68.062485 
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ASP 


278 


94.902039 




ARG 


279 


113.549011 




ASN~ 


280 


134.886017 




ALA- 


281 


72.340973 


5 


LEU 


282 


26.692348 




THR~" 


283 


27.696728 




ASP 


'284 


72.214157 




CYS~ 


285 


0.000000 




SER 


"286 


28.209335 


10 


ASP 


"287 


64.560753 




VAL~ 


288 


7.040061 




ILE 


"289 


8.665112 




PRO" 


"290 


48.682365 




SER 


291 


86.141670 


15 


ALA 


292 


29.031240 




VAL" 


293 


84.432014 




SER 


294 


85.944153 




ASN 


"295 


49.017288 




ASN~ 


"296 


133.459198 


20 


ALA 


*297 


57.283794 




ALA 


"298 


65.233749 




PRO 


"299 


24.751518 




VAL 


300 


45. 409184 




ILE 


301 


8.060802 


25 


PRO 


"302 


14.742939 




GLY 


"303 


16.589832 




GLY 


"304 


34.238071 




LEU 


"305 


24.719791 




THR 


"306 


49.356300 


30 


VAL 


"307 


71.491821 




ASP" 


"308 


130.906174 




ASP 


"309 


31.733070 




ILE*" 


"310 


19.581894 




GLU" 


"311 


81.414574 


35 


VAL" 


"312 


94.769890 




SER~ 


"313 


39.688896 




CYS~ 


"314 


9.998511 




pro" 


"315 


120.328018 




SER*" 


"316 


95.364319 


40 


GLU" 


"317 


65.560959 




PRO" 


'318 


100.254364 




PHE~ 


'319 


46.284115 




PRO" 


"320 


31.328060 




GLU" 


"321 


177.602249 


45 


ILE" 


"322 


33.449741 




ALA" 


"323 


46.892982 




THR" 


"324 


79.976471 




ALA" 


"325 


36.423820 




SER" 


"326 


124.467422 


50 


gly" 


"327 


28.219524 




PRO 


"328 


107.553696 




leu" 


"329 


86.789825 




pro"] 


"330 


34.287163 




ser" 


"331 


75.764053 


55 


leu" 


"332 


32.840569 




ALA" 


"333 


61.516434 




PRO" 


"334 


82.389992 
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ALAJ335 

PRO_336 

HEM_337 

CA_338 

CA_339 

NAG_340 

NAG 341 



10 



15 



20 



25 



6.246871 
56.750813 
60.435017 
2.078997 
0.000000 
141.534668 
186.311371 
Subset REST: 

restmole. list 
Subset REST: 

ARP: 9 ,69-70 ,125 ,127 ,133,299-301 ,334-336 
restatom. list 
Subset REST: 

ARP : SER 9:N,CA,C,0,CB,OG 
ARP : GLY 69:N,CA,C f O 

ARP : GLN 70:N,CA,C,O,CB,CG,CD,OEl,NE2 
ARP : GLY 125:N / CA,C,0 
ARP : SER 127:N, CA,C,0, CB,OG 
ARP:PRO 133:N,CA,CD,C,0,CB,CG 
ARP: SER 299:N,CA,C,O,CB,0G 
ARP : ALA 300 : N, CA, C, O, CB 
ARP : VAL 301:N,CA, C,0, CB,CG1, CG2 
ARP: SER 334:N / CA,C,0,CB,OG 
ARP: GLY 335:N,CA,C,0 
ARP:PRO 336:N,CA,CD,C,0,CB,CG 
Subset SUB5B: 

sub5bmole . list 
Subset SUB5B: 

ARP: 10-11, 34, 38, 65-68, 71-72, 120-121, 123-124, 
30 128-132,134,270,274, 

ARP :297-298,302-303 ,311-312, 332-333, 337-338 
sub5batom. list 
Subset SUB5B: 

ARP : VAL 10 : N, CA, C, O, CB, CGI , CG2 
ARP : THR ll:N,CA,C,0,CB,OGl,CG2 
ARP : GLN 34 : N, CA, C, O, CB , CG, CD, OE1 , NE2 
ARP : TYR 3 8 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
ARP: LEU 65:N,CA / C,0,CB,CG,CD1,CD2 
ARP : THR 66 :N, CA, C, O , CB, OG1 , CG2 
ARP: ALA 67:N,CA,C,0,CB 
ARP: ALA 68:N,CA,C,0,CB 

ARP : PHE 71:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
ARP : GLY 72:N,CA,C,0 

:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

:N,CA,C,0,CB 
:N,CA,C,0,CB 
:N, CA,C,0,CB,CG1,CG2 
:N,CA,C,0,CB,CG,0D1,ND2 
:N f CA,C,0,CB,SG 
:N,CA,CD,C,0,CB,CG 
:N,CA,C,0 
:N,CA,C,0,CB,OG 

:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
:N CA C O 

:n'ca',c!o,CB,CG,CD,NE,CZ,NH1,NH2 
:N f CA,C,0,CB,CG1,CG2,CD1 
:N,CA,CD,C,0,CB,CG 



35 



40 



45 



50 



55 



ARP: PHE 120: 

ARP: ALA 121: 

ARP: ALA 123: 

ARP: VAL 124: 

ARP : ASN 128; 

ARP:CYS 129: 

ARP: PRO 130: 

ARP: GLY 131: 

ARP: SER 132: 

ARP : ARG 134: 

ARP: GLY 270: 

ARP: ARG 274) 

ARP: ILE 297: 

ARP: PRO 298: 
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ARP:SER 302:N,CA,C,O,CB,OG 
ARP : ASN 3 03:N,CA, 0,0,03 ,CG,0D1,ND2 
ARP : GLY 311:N,CA,C,0 
ARP : GLY 312:N,CA,C,0 
5 ARP : THR 332:N,CA,C,0,CB,0G1,CG2 

ARP:ALA 333:N,CA,C,0,CB 
ARP : LEU 337:N,CA,C,0,CB,CG,CD1,CD2 
ARP:PRO 338:N,CA,CD,C,0,OB,CG 
Subset ACTSITE: 
10 actsitemole. list 
Subset ACTSITE: 

ARP: 44-61, 75-77, 79-80 ,87-88,90-96, 

99,118,122,126,135,148-149,152-158, 
ARP: 163-164, 167, 176-194, 197-205, 207-209, 211- 
15 213,216,230-231,241, 

ARP: 24 3-246, 249, 259, 273, 277, 280, 343-347H 
acts iteatom. list 



Subset ACTSITE 



20 



25 



30 



35 



40 



45 



50 



55 



ARP: 


GLU 


44: 


N , OA ,0,0 


,CB,CG,CD,0E1,0E2 






ARP: 


SER 


45: 


N, OA, 0,0 


,CB,OG 






ARP: 


PRO 


46: 


N,CA,CD,< 


~,0,CB,CG 






ARP: 


VAL 


47: 


N, OA, 0,0 


,CB,CG1,CG2 






ARP: 


ARG 


48: 


N, OA, 0,0 


,CB,CG,CD,NE,CZ,NH1 


,NH2 




ARP: 


LYS 


49: 


N, OA, 0,0 


,CB,CG,CD,CE,NZ 






ARP: 


ILE 


50: 


N, OA, 0,0 


,CB,CG1,CG2,CD1 






ARP: 


LEU 


51: 


N, OA, 0,0 


,CB,CG,CD1,CD2 






ARP: 


ARG 


52: 


N, OA, 0,0 


,CB,CG,CD,NE,CZ,NH1 


,NH2 




ARP: 


ILE 


53 : 


N,CA,C,0 


f CB,CG1,CG2,CD1 






ARP: 


VAL 


54: 


N,CA,0,O 


, OB , CGI , CG2 






ARP: 


PHE 


55: 


N, OA, 0,0 


,CB,CG,CD1,CD2,CE1, 


CE2, 


CZ 


ARP: 


HIS 


56: 


>N,CA,C,0 


,CB,CG,ND1,CD2,CE1, 


NE2 




ARP: 


ASP 


57: 


:N,CA,C,0 


,CB,CG,0D1,0D2 






ARP: 


ALA 


58: 


:N,CA,C,0 


,CB 






ARP: 


ILE 


59: 


:N,CA,C,0 


,CB,CG1,CG2,CD1 






ARP: 


GLY 


60; 


:N,CA,C,0 








ARP: 


tPHE 


61; 


:N,CA,C,0 


, CB , CG , CD1 , CD2 , CE1 , 


CE2, 


CZ 


ARP: 


► GLY 


75: 


:N,CA,C,0 








ARP: 


ALA 


76 


:N,CA,C,0 


,CB 






ARP; 


.ASP 


77< 


:N,CA,C f O 


,CB,CG,0D1,0D2 






ARP: 


:SER 


79 


:N,CA,C,0 


,CB,OG 






ARP: 


:ILE 


80 


:N,CA,C,0 


,CB,CG1,CG2,CD1 






ARP 


:GLU 


87 


:N,CA,C,0 


,CB,CG,CD,OEl,OE2 






ARP 


:LEU 


88 


:N,CA,C,0 


,CB,CG,CD1,CD2 






ARP 


:PHE 


90 


:N,CA,C,0 


,CB,CG,CD1,CD2,CE1, 


CE2, 


CZ 


ARP 


:PRO 


91 


:N,CA,CD, 


C , 0 , CB , CG 






ARP 


: ALA 


92 


:N,CA,C,0 


,CB 






ARP 


:ASN 


93 


:N,0A,C,O 


,CB,CG,0D1,ND2 






ARP 


:GLY 


94 


:N,CA,C,0 








ARP 


:GLY 


95 


:N,CA,C,0 








ARP 


:LEU 


96 


:N,CA,C,0 


,CB,CG,CD1,CD2 






ARP 


:THR 


99 


:N,CA,C,0 


,CB,0G1,CG2 






ARP 


:ILE 


118:N,CA,C, 


0,CB,CG1,CG2,CD1 






ARP 


:THR 


122:N,CA,C, 


0,CB,0G1,CG2 






ARP 


:MET 


126:N,CA,C, 


0,OB,CG,SD,CE 






ARP 


:LEU 


135:N,CA,C, 


0,CB,CG,CD1,CD2 






ARP 


:SER 


148:N,CA,C, 


0,CB,0G 






ARP 


:PRO 


149:N,CA,CD 


,C,0,CB,CG 
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ARP: 


LEU 


152: 


N, 


CA, 








ARP: 


ILE 


153: 


N, 


CA, 


c, 


o, 




ARP: 


PRO 


154: 


N, 


CA, 


CD,C 




ARP: 


GLY 


155: 


N, 


CA, 


c, 


0 


5 


ARP: 


PRO 


156: 


N, 


CA, 


CD,C 




ARP: 


GLY 


157: 


N, 


CA, 


c, 


0 




ARP: 


ASN 


158: 


N, 


CA, 


c, 


o, 




ARP: 


ILE 


163 : 


N, 


CA, 


c, 


o, 




ARP: 


LEU 


164: 


N, 


CA, 




0, 


10 


ARP: 


MET 


167: 


N, 


CA, 


c, 


0/ 




ARP: 


GLU 


176: 


N, 


CA, 




0, 




ARP: 


VAL 


177: 


N, 


CA, 


c, 


0/ 




ARP: 


VAL 


178: 


N, 


CA, 


c, 


o, 




ARP: 


ASP 


179: 


N, 


CA, 


c, 


o, 


15 


ARP: 


LEU 


180: 


N, 


CA, 


c / 


°r 




ARP: 


LEU 


181: 


N, 


CA, 


c, 


O r 




ARP: 


ALA 


182: 


N # 


CA, 


c. 


o, 




ARP: 


ALA 


183: 


N, 


CA, 


c, 


o, 




ARP: 


HIS 


184: 


N, 


CA, 


c, 


0, 


20 


ARP: 


SER 


185: 


N, 


CA, 


c, 


o, 




ARP: 


LEU 


186: 


N, 


CA, 


c, 


o, 




ARP: 


ALA 


187: 


N, 


CA, 


c, 


o, 




ARP: 


SER 


188: 


N, 


CA, 


c, 


o, 




ARP: 


GLN 


189: 


N, 


CA, 


c, 


o, 


25 


ARP: 


GLU 


190: 


N, 


CA, 


c, 


o, 




ARP: 


GLY 


191: 


N, 


CA, 


c, 


0 




ARP: 


:LEU 


192: 


N, 


CA, 


c, 


o, 




ARP: 


:ASN 


193: 


N, 


CA, 


c, 


o, 




ARP: 


:SER 


194: 


N, 


CA, 


c, 


o, 


30 


ARP 


>PHE 


197: 


N, 


CA, 


c, 


o, 




ARP 


: ARG 


198: 


Ni 


CA, 


c, 


o, 




ARP 


: SER 


199: 


,N, 


CA, 


c, 


o, 




ARP 


:PRO 


200: 


•N, 


CA, 


CD,C 




ARP 


:LEU 


201' 


;N, 


,CA, 




o, 


35 


ARP 


: ASP 


202 


:Nj 


, CA, 


c, 


o, 




ARP 


:SER 


203 


:N, 


, CA, 


c, 


o, 




ARP 


:THR 


204 


:N, 


rCA, 




0 r 




ARP 


:PRO 


205 


:N, 


rCA, 


CD,C 




ARP 


:VAL 


207 


:N, 


rCA, 


c, 


Or 


40 


ARP 


:PHE 


208 


:N 


pCA, 








ARP 


:ASP 


209 


:N 


rCA, 


r C J 


,0, 




ARP 


: GLN 


211 


:N 


rCA, 


fC t 


rO, 




ARP 


:PHE 


212 


:N 


,CA, 




r 0, 




ARP 


: TYR 


213 


:N 


r CA ( 




rO, 


45 


ARP 


:THR 


216 


:N 


pCA, 


f c 


pO, 




ARP 


:PHE 


230 


:N 


, CA, 


r c 


rO, 




ARP 


: ALA 


231 


:N 


,CA 


r c 


rO, 




ARP 


:PHE 


241 


:N 


,CA 


t c 


pO, 




ARP 


:MET 


243 


:N 


, CA 


pC 


pO, 


50 


ARP 


: ARG 


244 


:N 


f CA 


rC 


pO, 




ARP 


:SER 


245 


:N 


r CA 


f c 


pO, 




ARP 


:ASP 


246 


:N 


,CA 


,c 


pO, 




ARP 


:LEU 


249 


:N 


,CA 


f c 


rO, 




ARP 


:TRP 


259 


:N 


,CA 


r c 


rO, 


55 






CD2 , 


r NEl,CE2 




ARP : TYR 


273 


:N 


,CA 


rC 


,0, 




ARP: MET 


277:N 


,CA 


f c 


rO, 



,CD2 



,CG,SD,CE 

,CG,CD,0E1,C 

, CGI , CG2 

, CGI , CG2 

,CG,OD1,OD2 

,CG,CD1,CD2 

,CG,CD1,CD2 



,CD2 



CB 

CB,0G 



,0E1,NE2 
,0E1,0E2 



CB,OG 
CB,CG,CD3 
CB,CG,CD, 
CB,0G 
0,CB,CG 



CE1,CE2,CZ 
,NH1,NH2 



:,0,CB,CG 
CB,CG1,CG2 

CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 
~ r CG,0Dl,0D2 
f CG,CD,0El,NE2 
f CG , CD1 , CD 2 , CE1 , CE2 , CZ 
f CG,CDl,CD2,CEl,CE2,CZ,OH 
f OGl,CG2 

, CG, CD1 , CD 2 , CE1 , CE2 , CZ 



CB, 
CB, 



,CD2,CE1,CE2,CZ 

CG , CD1 , CD2 , CE1 , CE2 , CZ 

CG,SD,CE 
nn pn 



CB,CG,SD,CE 
. CB,CG,CD,NE,CZ,NH1,NH2 
,0,CB,0G 

,0,CB,CG,0D1,0D2 
,0,CB,CG,CD1,CD2 
,0,CB,CG,CD1, 
:E2,CE3,CZ2,CZ3,CH2 
,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
,0,CB,CG,SD,CE 
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ARP : MET 280:N,CA,C,O,CB,CG,SD,CE 
ARP: ALA 343:N,CA,C, 0,CB 
ARP : PRO 344:N,CA,CD, 0,0,0X1,08,00 
ARP : HEM 345H: FE,NA,NB,NC,ND, CHA,CHB, 
5 CHC,CHD,C1A,C2A,C3A,C4A,CMA,CAA,CBA,CGA 
ARP : HEM 345H:01A, 02A, C1B, C2B, C3B, C4B, CMB, 

CAB,CBB,C1C,C2C,C3C,C4C,CMC / CAC / CBC 
ARP: HEM 3 4 5H : C1D , C2D , C3D , C4D , CMD , CAD , CBD , CGD , 01D , 02D 
ARP:CA 346H:CA 
10 ARP: OA 347H:CA 

Subset RESTx: 

restxmole. list 
Subset RESTX 

NEWMODEL: 9, 334-336 
15 restxatom. list 
Subset RESTX: 

NEWMODEL : SER 9 : N, OA, C , O, CB, OG 
NEWMODEL : SER 334 :N, OA, C,0, CB,OG 
NEWMODEL : GLY 335:N,CA,C,0 
20 NEWMODEL : PRO 336 : N, CA, CD, C, O, CB, CG 



Example 5 

Activation of mPEG 15,000 with N-succinimidvl carbonate 

25 mPEG 15,000 was suspended in toluene (4 ml/g of mPEG) 20% was 

distilled off at normal pressure to dry the reactants 
azeotropically. Dichloromethane (dry 1 ml/g mPEG) was added when 
the solution was cooled to 30°C and phosgene in toluene (1.93 M 5 
mole/mole mPEG) was added and mixture stirred at room temperature 

3 0 over night. The mixture was evaporated to dryness and the desired 
product was obtained as waxy lumps. 

After evaporation dichloromethane and toluene (1:2, dry 3 
ml/g mPEG) was added to re-dissolve the white solid. tf-Hydroxy 
succinimide (2 mole/mole mPEG.) was added as a solid and then 

35 triethylamine (1.1 mole/mole mPEG) . The mixture was stirred for 3 
hours, initially unclear, then clear and ending with a small 
precipitate. The mixture was evaporated to dryness and 
recrystallised from ethyl acetate (10 ml) with warm filtration to 
remove salts and insoluble traces. The blank liquid was left for 

40 slow cooling at ambient temperature for 16 hours and then in the 
refrigerator over night. The white precipitate was filtered and 
washed with a little cold ethyl acetate and dried to yield 98 % 
(w/w) . NMR Indicating 80 - 90% activation and 5 o/oo (w/w) 
HNEt 3 Cl. ^-NMR for mPEG 15,000 (CDC1 3 ) d 1.42 t (1= 4.8 CH 3 i 

45 HNEt 3 Cl) , 2.84 s (1= 3.7 succinimide), 3.10 dq (1= 3.4 CH 2 i 
HNEt 3 Cl), 3.38 S (1= 2.7 CH 3 i OMe) , 3.40* dd (I = 4.5 0/00, 13 C 



- WO 98/35026 



86 



PCT7DK98/00046 



satellite), 3,64 bs (I = 1364 main peak), 3.89* dd (I = 4.8 o/oo , 
13 C satellite), 4.47 dd (I = 1.8, CH 2 in PEG). No change was seen 
after storage in a desiccator at 22 °C for 4 months. 

5 Example 6 

Activation of mPEG 5.000 with N-succin imidvl carbonate 

Activation of mPEG 5,000 with N-succinimidyl carbonate was 
performed as described in Example 5. 

10 EXAMPLE 7 

Construction and expression of PD4 98 variants: 

PD498 site-directed variants were constructed using the "maxi- 

oligonucleotide-PCR 1 ' method described by Sarkar et al., (1990): 

BioTechniques 8: 404-407. 
15 The template plasmid was shuttle vector pPD498 or an analogue 

of this containing a variant of the PD498 protease gene. 

The following PD498 variants were constructed, expressed and 

purified. 

A: R28K 
20 B: R62K 

C: R169K 

D: R28K + R62K 

E: R28K + R169K 

F: R62K + R169K 
25 G: R28K+R69K+R169K 

Construction of variants 

For introduction of the R28K substitution a synthetic 

oligonucleotide having the sequence: GGG ATG TAA CCA AGG GAA GCA 
30 GCA CTC AAA CG (SEQ ID NO. 7) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 

prepared by Bst E II and Bgl II digestion. Positive variants were 

recognized by Styl digestion and verified by DNA sequencing of the 

total 769 bp insert. 
35 For introduction of the R62K substitution a synthetic 

oligonucleotide having the sequence: 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
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prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by Clal digestion and verified by DNA sequencing of the 
total 769 bp insert. 

For introduction of the R169K substitution a synthetic 
5 oligonucleotide having the sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) was used, 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by the absence of a Rsa I restriction site and verified 
10 by DNA sequencing of the total 769 bp insert. 

For simultaneously introduction of the R28K and the R62K 
substitutions, synthetic oligonucleotides having the sequence: 
GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 7) and the 
sequence : 

15 CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and verified 
by DNA sequencing of the total 769 bp insert. 

20 For simultaneously introduction of the R28K and the R169K 
substitutions , synthetic oligonucleotides having the sequence: GGG 
ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 8) and the 
sequence : 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 8) were used 

2 5 simultaneously. A PCR fragment of 769 bp was ligated into the 

pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert. 

30 For simultaneously introduction of the R62K and the R169K 
substitutions, synthetic oligonucleotides having the sequence: CGA 
CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) and the sequence: 
CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 

3 5 pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 

variants were recognized by Clal digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert 
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For simultaneously introduction of the R28K, the R62K and the 
R169K substitutions, synthetic oligonucleotides having the 
sequence : 

GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID No. 7), the 
5 sequence : 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) and the 
sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
10 pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and absence of 
a Rsa I site. The variant was verified by DNA sequencing of the 
total 769 bp insert. 

15 Fermentation, expression and purification of PD498 variants 

Vectors hosting the above mentioned PD498 variants were 
purified from E. coli cultures and transformed into B. subtilis in 
which organism the variants were fermented, expressed and purified 
as described in the "Materials and Methods" section above. 

20 

Example 7 

Conjugation of triple substitited PD498 variant with activated 
mPEG 5.000 

200 mg of triple substituted PD498 variant (i.e. the 
25 R28K+R62K+R169K substituted variant) was incubated in 50 mm 
NaBorate, pH 10, with 1.8 g of activated mPEG 5,000 with N- 
succinimidyl carbonate (prepared according to Example 2), in a 
final volume of 20 ml. The reaction was carried out at ambient 
temperature using magnetic stirring. Reaction time was 1 hour. The 
30 reaction was stopped by adding DMG buffer to a final concentration 
of 5 mM dimethyl glutarate, 1 mM CaCl2 and 50 mM borate, pH 5.0. 

The molecule weight of the obtained derivative was approxi- 
mately 120 kDa, corresponding to about 16 moles of mPEG attached 
per mole enzyme. 

35 Compared to the parent enzyme, residual activity was close to 

100% towards peptide substrate (succinyl-Ala-Ala-Pro-Phe-p- 
Nitroanilide) . 
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Example 8 

Alleraenicitv trails of PD498 variant-SPEG5 ( 000 in cruin ea pigs 

Dunkin Hartley guinea pigs are stimulated with 1.0 |ig PD498- 
SPEG 5,000 and l.o ^g modified variant PD498-SPEG 5,000 by 
5 intratracheal installation. 

Sera from immunized Dunkin Hartley guinea pigs are tested 
during the trail period in a specific IgG a ELISA (described above) 
to elucidate whether the molecules could activate the immune 
response system giving rise to a specific IgGx response indicating 
10 an allergenic response. 

The IgGx levels of Dunkin Hartley guinea pigs during the trail 
period of 10 weeks are observed. 

Example 9 

15 Suitable substitutions in Humlcola lanuginosa lipase for 

addition of amino attachment groups (-NHo l 

The 3D structure of Humicola lanuginosa lipase (SEQ ID NO 6) 

is available in Brookhaven Databank as Itib.pdb. The lipase 

consists of 269 amino acids. 
20 The procedure described in Example 1 was followed. The 

sequence of if. lanuginosa lipase is shown below in the table 

listing solvent accessibility data for H. lanuginosa lipase. 

H. lanuginosa residue numbering is used (1-269) , and the active 

site residues (functional site) are S146 # S201 and H258. The 
25 synonym TIB is used for H . lanuginosa lipase. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeKzone.bcl and makeKzone2 . bcl below: 

Conservative substitutions: 

3 0 makeKzone.bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 255,0,255 

3 Zone Subset LYS :lys:NZ Static monomer /residue 10 
Color_Subset 255,255,0 

35 4 Zone Subset NTERM :1:N Static monomer/residue 10 
ColorjSubset 255,255,0 

5 #NOTE: editnextline ACTSITE residues according to the 
protein 

6 Zone Subset ACTSITE : 146, 201, 258 Static monomer/residue 8 
40 Color_Subset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #N0TE: editnextline object name according to the protein 
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10 Combine Subset REST Difference TIB ALLZONE 

11 List Subset REST Atom Output File restatom. list 

12 List Subset REST monomer /residue Output JFile restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 14 List Subset ACTSITE Atom Output^File actsiteatom. list 

15 List Subset ACTSITE monomer/ residue OutputJFile 
actsitemole . list 

16 # 

17 Zone Subset REST5A REST Static Monomer/Residue 5 - 
10 Color_Subset 

18 Combine Subset SUB5A Difference REST5A ACTSITE 

19 Combine Subset SUB5B Difference SUB 5 A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255,255,255 

15 21 List Subset SUB5B Atom Output File sub5 bat om. list 

22 List Subset SUB5B monomer /residue OutputJFile subSbmole. list 

23 #Now identify sites for lys->arg substitutions and continue 
with makezone2 .bcl 

24 #Use grep command to identify ARG in restatom. list , 
20 subSbatom. list & accsiteatom. list 

Comments : 

In this case of H . lanuginosa (=TIB) , REST contains the 
Arginines Argl33, Argl39, Argl60, Argl79 and Arg 209, and SUB5B 
25 contains Argll8 and R125. 

These residues are all solvent exposed. The substitutions 
R133K, R139K, R160K, R179K, R209K, R118K and R125K are 
identified in TIB as sites for mutagenesis within the scope of 
this invention. The residues are substituted below in section 
30 2, and further analysis done. The subset ACTSITE contains no 
lysines. 



Non-conservative substitutions: 
makeKzone2 • bcl 

35 1 #sourcefile makezone2 .bcl Claus von der Osten 961128 

2 # 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #N0TE: editnextline object name according to protein 
40 5 Copy Object -To_Clipboard -Displace TIB newmodel 

6 Biopolymer 

7 #N0TE: editnextline object name according to protein 

8 Blank Object On TIB 

9 #NOTE: editnextlines with lys->arg positions 
45 10 Replace Residue newmodel: 118 lys L 

11 Replace Residue newmodel: 125 lys L 

12 Replace Residue newmodel: 133 lys L 

13 Replace Residue newmodel: 139 lys L 

14 Replace Residue newmodel: 160 lys L 
50 15 Replace Residue newmodel: 179 lys L 

16 Replace Residue newmodel: 209 lys L 
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17 * . , 

18 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

19 Color Molecule Atoms newmodel Specified Specification 
5 255,0,255 

2 0 Zone Subset LYSx newmodel : lys:NZ Static monomer /residue 10 
Color_Subset 255,255,0 

21 Zone Subset NTERMx newmodel: 1:N Static monomer /residue 10 
Color_Subset 255,255,0 
10 22 #N0TE: editnextline ACTSITEx residues according to the 
protein 

23 Zone Subset ACTSITEx newmodel: 146, 201, 258 Static 
monomer/ residue 8 Color_Subset 255,255,0 

24 Combine Subset ALLZONEx Union LYSx NTERMx 

15 25 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

26 Combine Subset RESTx Difference newmodel ALLZONEx 

27 List Subset RESTx Atom Output_File restxatom. list 

28 List Subset RESTx monomer /residue Output_File 
restxmole. list 

20 29 # 

30 Color Molecule Atoms ACTSITEx Specified Specification 
255,0,0 

31 List Subset ACTSITEx Atom Output File actsitexatom. list 

32 List Subset ACTSITEx monomer/ residue Output_File 
25 actsitexmole . list 

33 # . 

34 #read restxatom. list or restxmole. list to identify sites 
for (not_arg) ->lys subst. if needed 

3 0 Comments: 

Of the residues in RESTx, the following are >5% exposed (see 
lists below): 18,31-33,36,38,40,48,50,56-62,64,78,88,91-93,104- 
106,120,136,225,227-229,250,262,268. Of these three are 
Cysteines involved in disulfide bridge formation, and 
35 consequently for structural reasons excluded from the residues 
to be mutated. The following mutations are proposed in H. 
lanuginosa lipase (TIB) : 

A18K,G31K,T32K / N33K,G38K,A40K,D48K,T50K,E56K,D57K,S58K,G59K, 
V60K,G61K,D62K,T64K,L78K,N88K,G91K,N92K,L93K,S105K,G106K, 
40 V120K,P136K,G225K,L227K,V228K,P229K,P250K,F262K. 
Relevant data for Example 2: 

# TIBNOH20 

# residue area 
GLU_1 110.792610 

45 VAL_2 18.002457 

SER_3 53.019516 

GLN_4 85.770164 

ASP_5 107.565826 

LEU_6 33.022659 

50 PHE_7 34.392754 

ASN 8 84.855331 
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GLN_9 
PHE_10 
ASN_11 
LEUJL2 
5 PHE_13 
ALA_14 
GLN_15 
TYR_16 
SER_17 

10 ALA_18 
ALA_19 
ALA_2 0 
TYR_21 
CYS_22 

15 GLY_2 3 
LYS_24 
ASN_25 
ASN_26 
ASP_27 

20 ALA_28 
PRO_29 
ALA_30 
GLY_31 
THRJ52 

25 ASN_33 
ILE_34 
THR_35 
CYS_3 6 
THR_37 

30 GLY_38 
ASN_39 
ALA_40 
CYS_41 
PRO_42 

35 GLU_43 
VAL_44 
GLU_45 
LYS_46 
ALA_47 

40 ASP_48 
ALA_49 
THR_50 
PHE_51 
LEU_52 

45 TYR_53 
SER_54 
PHE_55 
GLU_56 
ASP_57 

50 SER_58 
GLY_59 
VAL_60 
GLY_61 
ASP_62 

55 VAL_63 
THR_64 
GLY 65 



39.175591 

2.149547 

40.544380 

27.648788 

2.418241 

4.625293 

28.202387 

0.969180 

0.000000 

7.008336 

0.000000 

0.000000 

6.947358 

8.060802 

32.147034 

168.890747 

8.014721 

II. 815564 
92.263428 
18.206699 
83.188431 
69.428421 
50.693439 
52.171135 

III. 230743 
2.801945 
82.130569 
17.269245 
96.731941 
77.870995 
123.051003 
27.985256 
0.752820 
46.258949 
69.773987 
0.735684 
77. 169510 
141.213562 
10.249716 
109.913902 
2.602721 
32.012184 
8.255627 
60.093613 
77.877937 
26.980494 
10.747735 
112.689758 
92.064278 
32.990780 
53.371807 
83.563644 
69.625633 
75.520988 
4.030401 
8.652839 
0.000000 
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PHE 


66 


0.268693 




LEU 


'67 


11.822510 




ALA 


'68 


0.537387 




LEU 


'69 


30.243870 


5 


ASP 


'70 


0.000000 




ASN 


'71 


84.101044 




THR 


72 


89.271126 




ASN 


'73 


70.742401 




LYS~ 


"74 


98.319168 


10 


LEU 


'75 


8.329495 




ILE 


"76 


5.197878 




VAL 


77 


0.806080 




LEU_ 


'78 


5.293978 




SER 


'79 


0.000000 


15 


PHE" 


80 


2.079151 




ARG 


81 


41.085312 




GLY_ 


82 


1.471369 




SER~ 


'83 


43.794014 




ARG 


'84 


100.261627 


20 


SER" 


85 


70.607552 




ILE" 


"86 


59.696865 




GLU~ 


"87 


136.510773 




ASN" 


"88 


119.376373 




TRP 


"89 


102.851227 


25 


ILE" 


"90 


78.068588 




GLY 


'91 


60.783607 




ASN 


"92 


45.769428 




LEU" 


"93 


134.228363 




ASN 


"94 


101.810959 


30 


PHE 


"95 


41.212212 




ASP" 


"96 


79.645950 




leu" 


"97 


25.281572 




LYS" 


"98 


88.840263 




GLU" 


'99 


132.377090 


35 


ILE 


"100 


9.135575 




ASN" 


"101 


63.444527 




asp" 


"102 


88.652847 




ile" 


"103 


33.470661 




CYS" 


"104 


11.553816 


40 


SER" 


"105 


99.461174 




GLY" 


"106 


40.325161 




CYS" 


"107 


4.433561 




ARG" 


"108 


97.450104 




GLY" 


"109 


1.343467 


45 


his" 


"no 


4.652464 




ASP" 


111 


37.023655 




GLY" 


"112 


29.930408 




PHE" 


"113 


14.976435 




THR" 


"114 


10.430954 


50 


ser" 


"115 


40.606895 




ser] 


"116 


13.462922 




TRP" 


"117 


10.747735 




ARG* 


118 


114.364281 




SER" 


"119 


46.880249 


55 


VAL 


"120 


13.434669 




ALA" 


"121 


18.258261 




ASP" 


"122 


110.753098 
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THR 


123 


69.641922 




LEU 


124 


17.090784 




ARG 


125 


73.929977 




GLN 


126 


101.320190 


5 


LYS 


127 


84.450241 




VAL 


128 


6.448641 




GLU 


129 


47.700993 




ASP 


130 


75.529091 




ALA 


131 


11.340775 


10 


VAL 


132 


27.896025 




ARG~ 


133 


153.136490 




GLU 


134 


132.140594 




HIS 


135 


54.553406 




PRO 


136 


97.386963 


15 


ASP 


137 


22.653191 




TYR 


138 


35.392658 




ARG 


139 


74.321243 




VAL 


'140 


10.173222 




VAL 


141 


0.233495 


20 


PHE 


'142 


3.224321 




THR 


'143 


0. 000000 




GLY 


144 


0.000000 




HIS" 


145 


4.514527 




SER 


146 


15.749787 


25 


LEU"" 


'147 


40.709171 




GLY" 


"148 


0.000000 




GLY*" 


"149 


0.000000 




ALA" 


"150 


0.537387 




LEU" 


"151 


22.838938 


30 


ALA" 


"152 


0.268693 




THR" 


*153 


18.078798 




VAL" 


"154 


7.254722 




ALA" 


"155 


0.000000 




GLY" 


"156 


0.000000 


35 


ALA" 


"157 


15.140230 




ASP" 


"158 


41.645477 




leu" 


"159 


6.144750 




arg" 


"160 


41.939716 




GLY" 


"161 


68.978180 


40 


ASN" 


"162 


68.243805 




GLY' 


"163 


79.181274 




TYR" 


"164 


36.190247 




ASP" 


"165 


103.068283 




ILE" 


"166 


0.000000 


45 


asp" 


"167 


24.326443 




VAL" 


"168 


4.299094 




PHE" 


"169 


0.466991 




ser" 


"170 


3.339332 




tyr" 


"171 


0.000000 


50 


GLY" 


"172 


0.000000 




ALA" 


"173 


12.674671 




PRO" 


"174 


13.117888 




arg" 


"175 


10.004488 




VAL" 


176 


21.422220 


55 


GLY" 


"177 


2.680759 




ASN 


"178 


21.018063 




ARG" 


179 


110.282166 
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ALAJL80 
PHE_181 
ALAJL82 
GLUJL83 
5 PHE_184 
LEU_185 
THRJL86 
VAL_187 
GLN_188 

10 THR_189 
GLY_190 
GLY_191 
THR_192 
LEUJL93 

15 TYR_194 
ARGJL95 
ILE_196 
THR_197 
HIS_198 

20 THR_199 
ASN_200 
ASP_201 
ILEJ202 
VAL_203 

25 PRO_204 
ARG_2 05 
LEU_2 06 
PRO_2 07 
PRO_208 

3 0 ARG_209 
GLU_210 
PHE_211 
GLY_212 
TYR_213 

3 5 SER_214 
HIS_215 
SER_216 
SER_217 
PRO_218 

40 GLU_219 
TYR_220 
TRP_221 
ILE_222 
LYS_223 

45 SER_224 
GLY_225 
THR_226 
LEU_227 
VAL_228 

50 PRO_229 
VAL_230 
THR_231 
ARG_232 
ASN_233 

55 ASP_234 
ILE_235 
VAL 23 6 



33.210381 

4.567788 

3.897251 

76.354004 

71.225983 

24.985012 

47.023815 

98.244606 

54.152954 

88.660645 

24.792120 

10.726818 

45.458744 

16.633211 

34.829491 

29.030851 

1.973557 

3.493014 

1.532270 

34.785877 

39.789238 

0.000000 

31.168434 

29.521076 

3.515322 

44.882454 

51.051746 

12.575329 

43.259636 

113.700233 

154.628540 

112.505188 

30.084938 

3.268936 

12.471436 

23.354481 

16.406200 

14.665598 

17.240993 

13.145291 

18.718306 

39.229233 

5.105175 

120.739983 

15.407301 

29.306646 

66.806862 

122.682808 

60.923004 

104.620377 

23.398251 

63.372971 

80.357857 

89.255066 

43.011250 

2.114349 

45.140491 
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LYS 


237 


105.651306 




ILE 


238 


24.671705 




GLU 


239 


116.891907 




GLY 


"24 0 


31.965794 


5 


ILE 


"241 


46.278099 




ASP~ 


242 


28.963699 




ALA~ 


"243 


25.158146 




THR" 


'244 


98.351440 




GLY*" 


"245 


43.842186 


10 


GLY 


'246 


0.700486 




ASN 


"247 


3.926274 




ASN 


"248 


51.047890 




GLN" 


249 


66.699188 




PRO 250 


132.414047 


15 


ASN 251 


70.213730 




ILE 


252 


141.498062 




pro" 


"253 


59.089233 




asp" 


"254 


59.010895 




ILE" 


"255 


63.298943 


20 


pro" 


"256 


78.608688 




ALA 


"257 


0.806080 




HIS 


'258 


3.761708 




LEU" 


'259 


50.747856 




trp" 


"260 


35.229710 


25 


tyr" 


"261 


5.440791 




phe" 


262 


36.457939 




GLY~ 


'263 


22.071375 




LEU 


264 


109.148178 




ILE" 


"265 


2.418241 


30 


GLY" 


"266 


17.730062 




THR" 


"267 


68.217873 




CYS" 


"268 


15.418195 




LEU" 


"269 


165.990997 



Subset REST: 
35 restmole. list 
Subset REST: 

TIB: 5, 8-9, 13-14,16,18-20,31-34, 36, 38,40,48-50, 56- 
66,68,76-79,88,91-93, 

TIB: 100-107 , 116-117 , 119-121, 132-134 , 136 , 139-142 , 154 
40 169,177-185, 

TIB : 187, 189-191, 207-212, 214-216, 225, 227-229, 241- 

244,250,262,268 
restatom.list 
Subset REST: 
45 TIB:ASP 5:N,CA,C,0,CB,CG,0D1,0D2 

TIB: ASN 8:N,CA,C,0,CB,CG,0D1,ND2 

TIB : GLN 9 : N , CA , C , O , CB , CG , CD , OE1 , NE2 

TIB : PHE 13 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 

TIB: ALA 14:N,CA,C,0,CB 
5 0 TIB : TYR 16 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

TIB: ALA 18 :N, CA, C, O, CB 

TIB: ALA 19 :N, CA, C, O, CB 

TIB: ALA 20:N,CA,C,O,CB 

TIB: GLY 31:N,CA,C,0 
55 TIB: THR 32:N,CA,C,0,CB,0G1,CG2 

TIB: ASN 33:N,CA,C,0,CB,CG,0D1,ND2 

TIB: ILE 34:N,CA,C,0,CB,CG1,CG2,CD1 
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TIB: 


CYS 


36: 


N,CA,C,0 




TIB: 


GLY 


38: 


N,CA,C,0 




TIB: 


ALA 


40: 


N , C A , C , 0 




TIB: 


ASP 


48: 


N,CA,C,0 


5 


TIB: 


ALA 


49: 


N,CA,C,0 




TIB: 


THR 


50: 


N, CA,C f 0 




TIB: 


GLU 


56: 


N,CA,C,0 




TIB: 


ASP 


57: 


:N,CA,C,0 




TIB: 


SER 


58: 


,N,CA,C,0 


10 


TIB: 


GLY 


59: 


;N,CA,C,0 




TIB: 


VAL 


60: 


;N,CA,C,0 




TIB: 


GLY 


61: 


N,CA,C,0 




TIB: 


ASP 


62: 


:N,CA,C,0 




TIB: 


VAL 


63: 


:N,CA,C,0 


15 


TIB: 


:THR 


64: 


:N,CA,C,0 




TIB: 


;GLY 


65' 


;N,CA,C,0 




TIB: 


:PHE 


66 


:N,CA,C,0 




TIB 


t ALA 


68 


: N , C A , C , 0 




TIB 


:ILE 


76 


:N,CA, C,0 


20 


TIB 


:VAL 


77 


:N,CA,C,0 




TIB 


:LEU 


78, 


:N,CA,C,0 




TIB 


:SER 


79 


:N,CA,C,0 




TIB 


:ASN 


88 


:N,CA,C,0 




TIB 


:GLY 


91 


:N,CA,C,0 


25 


TIB 


:ASN 


92 


:N,CA,C,0 




TIB 


:LEU 


93 


:N,CA,C,0 




TIB 


:ILE 


100:N,CA,C, 




TIB 


:ASN 


101:N,CA,C, 




TIB 


:ASP 


102:N,CA, C, 


30 


TIB 


:ILE 


103:N,CA, C, 




TIB 


:CYS 


104:N,CA,C, 




TIB 


:SER 


105:N,CA,C, 




TIB 


:GLY 


106:N,CA,C, 




TIB 


:CYS 


107:N,CA, C, 


35 


TIB 


:SER 


116:N,CA,C, 




TIB 


:TRP 


117:N,CA,C, 



40 



45 



50 



55 



, u 

,0,0b 

,0,06,00,001,002 
,0,0b 

,0,CB,0G1,CG2 
,0,CB,CG,CD,0E1,0E2 
P 0,CB,CG,0D1,0D2 
,0,CB,0G 



,CB,CG,0D1,0D2 
,CB,CG1,CG2 
, CB,0G1, CG2 

, CB , CG , GDI , CD2 , CE1 , CE2 , CZ 
CB 

,CB,CG1,CG2,CD1 
,CB,CG1,CG2 
,CB,CG,CD1,CD2 
CB,OG 

CB,CG,0D1,ND2 



CE3 , CZ2 
TIB: SER 
TIB: VAL 
TIB: ALA 
TIB: VAL 
TIB: ARG 
TIB: GLU 
TIB: PRO 
TIB: ARG 
TIB: VAL 
TIB: VAL 
TIB:PHE 
TIB: VAL 
TIB: ALA 
TIB: GLY 
TIB: ALA 
TIB: ASP 
TIB: LEU 
TIB: ARG 
TIB: GLY 
TIB : ASN 



,CZ3,CH2 
119:N,CA,C 

1 20iN_CA,C 



121:N 
132:N 
133:N 
134:N 



139 



155 



N 



OA, C^O^CB 



„„N,CA,C,0 
156:N,CA,C 
157:N,CA,C,0,CB 
158:N 



CA,C,0,CB, 



,0D2 



CB,OG 



CB,OG 

CB / CG,CD1,CD2,NE1,CE2, 

CB,OG 
CB,CG1,CG2 
PR 



CB 

CA,C,0,CB,CG1,CG2 
CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
„ . ^,CA,C,0,CB,CG,CD,OEl,OE2 
136:N,CA,CD,C,0,CB,CG 

- - - - r»ia nn nr\ mp r»«7 MH1 MH5 



>,C,0,CB,CG 
CA,C,0,OB,CG,CD, 
0,CB,CG1,CG 



: , O , CB , CG 

CB,CG,CD,NE,CZ,NH1,NH2 
:N,CA,C,0,CB,CG1,CG2 
141:N,CA,C,0,CB,CG1,CG2 
142:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
.u ™ n r\ , CB,CG1,CG2 
-CB 



,CG,0D1,0D2 
W ,0,CB,CG,CD1,CD2 
160:N,CA,C,0,CB,CG,CD,NE,CZ,NH1, 
161:N,CA,C,0 

162:N,CA,C,0,CB,CG,0D1,ND2 



NH2 
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TIB: 


GLY 


163: 


N, 


CA, 


C,0 






TIB: 


TYR 


164: 


N, 


CA, 


C,O f CB,CG,CDl,CD2,CEl, 


CE2,CZ,OH 




TIB: 


ASP 


165: 


N, 


CA, 


C,0,CB,CG,ODl,OD2 






TIB: 


ILE 


166: 


N, 


CA, 


C,0,CB,CG1,CG2,CD1 




5 


TIB: 


ASP 


167: 


N, 


CA, 


0,0,06,00,001,002 






TIB: 


VAL 


168: 


N, 


CA, 


C,0,CB,CG1,CG2 






TIB: 


PHE 


169: 


N, 


CA, 


C,0,CB,CG,CD1 / CD2,CE1, 


CE2 , CZ 




TIB: 


GLY 


177: 


N, 


CA, 


c,o 






TIB: 


ASN 


178: 


N, 


CA, 


C,0,CB,CG,0D1,ND2 




10 


TIB: 


ARG 


179: 


N, 


CA, 


C,0,CB,CG,CD,NE,CZ,NH1 


,NH2 




TIB: 


ALA 


180: 


N, 


CA, 


C,0,CB 






TIB: 


PHE 


181: 


N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2 , CZ 




TIB: 


ALA 


182: 


N, 


CA, 


C,0,CB 






TIB: 


GLU 


183: 


N, 


CA, 


0,0, CB , CG , CD , 0E1 , 0E2 




15 


TIB: 


PHE 


184: 


N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ 




TIB: 


LEU 


185: 


N, 


CA, 


C,0,CB,CG,CD1,CD2 






TIB: 


VAL 


187: 


N, 


CA, 


C,0,CB,CG1,CG2 






TIB: 


THR 


189: 


N, 


CA, 


C,0,CB,OG1,CG2 






TIB: 


GLY 


190: 


N, 


CA, 


C,0 




20 


TIB: 


GLY 


191: 


• N, 


CA, 


C,0 






TIB: 


PRO 


207: 


:N, 


CA, 


CD,C,0,CB,CG 






TIB: 


PRO 


208: 


:N, 


,CA, 


CD,C,0,CB,CG 






TIB: 


ARG 


209: 


:N, 


CA, 


CjOjCBfCGjCDjNE^Z^Hl 


,NH2 




TIB: 


.GLU 


210: 


:N, 


,CA, 


0,0, CB , CG , CD , 0E1 , 0E2 




25 


TIB: 


:PHE 


211: 


:N, 


,CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ 




TIB: 


:GLY 


212: 


:N ( 


,CA, 


C,0 






TIB. 


SER 


214 


:N, 


,CA, 


C , 0 , CB , OG 






TIB- 


:HIS 


215 


:N, 


,CA, 


C,0,CB,CG,ND1,CD2,CE1, 


NE2 




TIB 


:SER 


216 


:N, 


,CA, 


C,O f CB,OG 




30 


TIB 


:GLY 


225 


:N, 


,CA, 


,C,0 






TIB 


:LEU 


227 


:N, 


rCA ( 


,0,0,06,00,001,002 






TIB 


:VAL 


228 


:N, 


-CA, 


,0,0,08,001,002 






TIB 


:PRO 


229 


:N 


r CA 


,CD,C,0,CB,CG 






TIB 


:ILE 


241 


:N 


r CA 


,0,0,08,001,062,001 




35 


TIB 


:ASP 


242 


:N 


,CA 


,C,0,CB,CG,0D1,0D2 






TIB 


: ALA 


243 


:N 


,CA 


r C,0,CB 






TIB 


:THR 


244 


:N 


,CA 


,C,0,CB,0G1,CG2 






TIB 


:PRO 


250 


:N 


,CA 


, CD , C , 0 , CB , CG 






TIB 


:PHE 


2 62 


:N 


,CA 


,C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ 


40 


TIB 


:CYS 


268 


:N 


,CA 


r C,0,CB,SG 





Subset SUB5B: 

sub5mole. list 
Subset SUB5B: 

TIB: 3-4, 6-7, 10-12, 15, 22-23 ,25-30, 35, 37, 39, 41-42, 44-47, 51- 

45 55,67,69-70, 

TIB: 72, 74-75, 94-99, 108-112, 114-115, 118, 122-126, 128- 

131,135,137-138, 

TIB: 186, 188, 192-195, 213, 217-219 ,223-224, 230-231, 234-235, 238- 
240, 

50 TIB:245,269 

sub5batom. list 
Subset SUB5B: 

TIB: SER 3 :N, CA, C, O, CB, OG 
TIB:GLN 4:N,CA,C,0,CB,CG,CD,0E1,NE2 
55 TIB: LEU 6 : N, CA, C, O, CB, CG, CD1, CD2 

TIB : PHE 7 : N , CA, C , O , CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 
TIB: PHE 10:N,CA,C / 0,CB,CG,CD1,CD2,CE1,CE2,CZ 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



TIB: 


ASN 


11: 


TIB: 


LEU 


12: 


TIB: 


GLN 


15: 


TIB: 


CYS 


22: 


TIB: 


GLY 


23: 


TIB: 


ASN 


25: 


TIB: 


ASN 


26: 


TIB: 


ASP 


27: 


TIB: 


ALA 


28: 


TIB: 


PRO 


29: 


TIB: 


ALA 


30: 


TIB: 


THR 


35: 


TIB: 


THR 


37: 


TIB: 


ASN 


39: 


TIB: 


CYS 


41: 


TIB: 


PRO 


42: 


TIB: 


VAL 


44: 


TIB: 


GLU 


45: 


TIB: 


LYS 


46: 


TIB: 


ALA 


47: 


TIB: 


PHE 


51: 


TIB: 


LEU 


52: 


TIB: 


TYR 


53: 


TIB: 


SER 


54: 


TIB: 


PHE 


55: 


TIB: 


LEU 


67: 


TIB: 


LEU 


69: 


TIB: 


ASP 


70: 


TIB: 


THR 


72: 


TIB: 


LYS 


74: 


TIB: 


LEU 


75: 


TIB: 


ASN 


94: 


TIB: 


PHE 


95: 


TIB: 


>ASP 


96: 


TIB: 


>LEU 


97: 


TIB: 


:LYS 


98: 


TIB: 


GLU 


99: 


TIB: 


ARG 


108 


TIB: 


GLY 


109 


TIB: 


HIS 


110 


TIB: 


ASP 


111 


TIB: 


.GLY 


112 


TIB 1 


:THR 


114 


TIB 


;SER 


115 


TIB 


; ARG 


118 


TIB 


:ASP 


122 


TIB 


:THR 


123 


TIB 


:LEU 


124 


TIB 


: ARG 


125 


TIB 


:GLN 


126 


TIB 


:VAL 


128 


TIB 


:GLU 


129 


TIB 


:ASP 


130 


TIB 


: ALA 


131 


TIB 


:HIS 


135 


TIB 


:ASP 


137 


TIB 


:TYR 


138 



N, 


CA, 


c, 


0,C6,CG,0D1,ND2 






N, 


CA, 


c, 


0,06,00,001,002 






N, 


CA, 


c, 


0,C8,CG,CD,0E1,NE2 






N, 


CA, 


c, 


0,CB,SG 






N, 


CA, 


c, 


0 






N, 


CA, 


c, 


0,CB,CG,0D1,ND2 






N, 


CA, 


c, 


0,CB,CG,0D1,ND2 






N, 


CA, 


c, 


0,CB,CG,0D1,0D2 






N, 


CA, 


C, 


0,CB 






N, 


CA, 


CD,C,0,C8,CG 






N, 


CA, 


c, 


0,CB 






N, 


CA, 


C| 


0,CB,0G1,CG2 






N, 


CA, 


c, 


0,CB,0G1,CG2 






N, 


CA, 


c, 


0,CB,CG,0D1,ND2 






N, 


CA, 


c, 


0,CB, SG 






N, 


CA, 


CD,C, 0,CB,CG 






N, 


CA, 


c, 


0,CB,CG1,CG2 






N, 


CA, 


c, 


0, CB,CG, CD,0E1,0E2 






N, 


CA, 


c, 


0,CB, CG,CD,CE,NZ 






N, 


CA, 


c, 


0,CB 






N, 


CA, 


c, 


0 , CB , CG , CD1 , CD2 , CE1 , 


CE2 


,CZ 


N , 


CA, 


c, 


0,CB, CG,CD1,CD2 






N, 


CA, 


c, 


0 , CB , CG , CD1 , CD2 , CE1 , 


CE2 


,CZ,OH 


N, 


CA, 


c, 


0,CB,OG 






N, 


CA, 




0,CB,CG,CD1,CD2,CE1, 


CE2 


,CZ 


N, 


CA, 


c, 


0,CB,CG,CD1,CD2 






N, 


CA, 


c, 


O r CB,CG,CDl,CD2 






N, 


CA, 


c# 


O r CB,CG,0Dl,OD2 






N, 


CA, 


c, 


0,CB,0G1,CG2 






N| 


CA, 


c, 


0,CB f CG,CD,CE,NZ 






N, 


CA, 


c, 


0,CB,CG,CD1,CD2 






N, 


CA, 


c, 


0,CB,CG, 0D1,ND2 






N, 


CA, 


c, 


0,06,00,001, CD2 , CE1 , 


CE2 


,CZ 


N ( 


CA, 


c, 


0,CB,CG,0D1,0D2 






N, 


CA, 


c, 


0,C6,CG,CD1,CD2 






N, 


CA, 


c, 


0,C8,CG,CD,CE,NZ 






N, 


,CA, 


c, 


0,C8,CG,CD,OE1,OE2 
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TIB: THR 186 
TIB:GLN 188 
TIB: THR 192 
TIB: LEU 193 
5 TIB: TYR 194 

TIB: ARG 195 
TIB: TYR 213 
TIB: SER 217 
TIB: PRO 218 
10 TIB:GLU 219 

TIB:LYS 223 
TIB: SER 224 
TIB : VAL 230 
TIB: THR 231 
15 TIB:ASP 234 

TIB: ILE 235 
TIB: ILE 238 
TIB: GLU 239 
TIB:GLY 240 
20 TIB:GLY 245 

TIB: LEU 269 
Subset ACTSITE: 

actsitemole . list 
Subset ACTSITE: 

25 TIB: 17,21, 80-87,89-90, 113 ,143-153 ,170-176 ,196-206 ,221- 

222,226,246-249, 
TIB: 251-261, 263-267 
actsiteatom. list 
Subset ACTSITE: 



30 


TIB: 


SER 


17:N,CA,C,0,CB,OG 








TIB: 


TYR 


2 1 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , 


CZ, 


OH 




TIB: 


PHE 


80:N,CA,C,O,CB,CG,CDl,CD2,CEl,CE2, 


CZ 






TIB: 


ARG 


81:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 








TIB: 


.GLY 


82:N,CA,C,0 






35 


TIB: 


:SER 


83:N,CA,C,0,CB,OG 








TIB: 


: ARG 


84:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 








TIB: 


.SER 


85:N,CA,C,0,CB,OG 








TIB: 


:ILE 


86:N,CA,C,0,CB,CG1,CG2,CD1 








TIB: 


:GLU 


87:N,CA,C,O,CB,CG,CD,0El,0E2 






40 


TIB: 


:TRP 


89:N,CA,C,0,CB,CG,CD1,CD2 / NE1,CE2, 


CE3 


,CZ2,CZ3,CH2 




TIB: 


:ILE 


90:N,CA,C,0,CB,CG1,CG2,CD1 








TIB: 


:PHE 


113:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2 


,CZ 






TIB: 


:THR 


143:N,CA,C,0,CB,OG1,CG2 








TIB- 


:GLY 


144:N,CA,C,0 






45 


TIB: 


:HIS 


145:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 








TIB: 


:SER 


146:N,CA,C,0,CB,OG 








TIB: 


:LEU 


147:N,CA,C,0,CB,CG,CD1,CD2 








TIB- 


:GLY 


148:N,CA,C,0 








TIB: 


: GLY 


149:N,CA,C,0 






50 


TIB- 


: ALA 


150:N,CA,C,O,CB 








TIB 


:LEU 


151:N,CA,C,0,CB,CG,CD1,CD2 








TIB 


: ALA 


152:N,CA,C,0,CB 








TIB 


:THR 


153:N,CA,C,0,CB,OG1,CG2 








TIB 


:SER 


170:N,CA,C,O,CB,OG 






55 


TIB 


:TYR 


171:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2 


,CZ 


,OH 




TIB 


:GLY 


172:N,CA,C,0 








TIB 


: ALA 


173:N,CA,C,0,CB 







;N,CA, 
:N,CA, 
:N,CA, 
:N,CA, 
:N,CA, 
:N,CA, 
;N,CA, 
IN, CA, 
;N,CA, 
:N,CA, 
:N,CA, 
:N,CA, 
;N,CA, 
;N,CA, 
;N,CA, 
;N,CA, 
:N,CA, 
:N,CA, 
:N,CA, 
:N,CA, 



C,0,CB 
C , O 



C , O , CB , CG , CD , OE1 , NE2 
0G1,CG2 
CG,CD1,CD2 

CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
CG,CD,NE,CZ,NH1,NH2 
CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
OG 

CB,CG 

CG,CD,OEl,OE2 
CG,CD,CE,NZ 
OG 

CG1,CG2 
OGl,CG2 
CG,ODl,OD2 
CG1,CG2,CD1 
CGI , CG2 , CD1 
CG,CD,OE1,OE2 



r - - f 

,CB, 
_,CB, 
0,CB, 
0,CB, 
0,CB, 
>,C,0, 
0,CB, 
0,CB, 
0,CB, 
w O,CB, 
C,0,CB, 
" 0,CB, 
0,CB, 
C,0,CB, 
C,0,CB, 
C,0 
C,0 

C,0,CB, 



C,0 
C 

c 
c 

CD 
C 
C 

c 
c 



OXT, CG, CD1,CD2 
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TIB: 


PRO 


174:N,CA, CD,C,0,CB,CG 






TIB: 


ARG 


175:N,CA,C, 


0,CB,CG,CD,NE,CZ,NH1 


,NH2 




TIB: 


VAL 


176:N, CA,C, 


0,CB,CG1,CG2 






TIB: 


ILE 


196:N f CA^! 


,0,CB,CG1,CG2,CD1 




5 


TIB: 


THR 


197:N,CA,C, 


0,CB,0G1,CG2 






TIB: 


HIS 


198:N,CA,C, 


,0,CB,CG,ND1,CD2,CE1, 


NE2 




TIB: 


THR 


199:N,CA,C, 


0,CB,0G1,CG2 






TIB: 


ASN 


200:N,CA,C, 


,0,CB,CG,0D1,ND2 






TIB: 


ASP 


201:N,CA,C, 


0,CB,CG,0D1,0D2 




10 


TIB: 


ILE 


202:N,CA,C, 


0,CB,CG1,CG2,CD1 






TIB: 


VAL 


203:N,CA,C, 


,0,CB,CG1,CG2 






TIB; 


PRO 


204:N,CA, 00,0,0, CB,CG 






TIB: 


; ARG 


205:N,CA,C, 


0,CB,CG,CD,NE,CZ,NH1 


,NH2 




TIB: 


:LEU 


206:N,CA,C, 


f 0,CB,CG,CDl,CD2 




15 


TIB: 


;TRP 










221: 


N,CA, 0,0,06,00, 


CD1 , CD2 , NE1 , CE2 , CE3 , 


CZ2, 




TIB: 


;ILE 


222:N,CA,C I 


,0,CB,CG1,CG2,CD1 






TIB: 


:THR 


226:N,CA,C, 


0,CB,0G1,CG2 






TIB j 


:GLY 


246:N,CA,C 4 


,0 




20 


TIB; 


;ASN 


247:N,CA,C< 


,0,CB,CG,0D1,ND2 






TIB; 


:ASN 


248:N,CA,C, 


,0,CB,CG,0D1,ND2 






TIB; 


: GLN 


249:N,CA,C, 


0, CB,CG,CD,OEl,NE2 






TIB; 


;ASN 


251:N,CA,C, 


,0,CB,CG,0D1,ND2 






TIB' 


;ILE 


252:N,CA,C, 


P 0,CB,CG1,CG2,CD1 




25 


TIB' 


;PRO 


253:N,CA,CD,C,0,CB,CG 






TIB; 


:ASP 


254:N,CA,C I 


0,CB,CG,0D1,0D2 






TIB; 


;ILE 


255:N,CA,C, 


,0,CB,CG1,CG2,CD1 






TIB: 


:PRO 


256:N,CA,CD,C,0,CB,CG 






TIB; 


: ALA 


257:N,CA,C 1 


,0,CB 




30 


TIB; 


:HIS 


258:N,CA,C J 


0,CB,CG,ND1,CD2,CE1, 


NE2 




TIB; 


;LEU 


259:N,CA,C i 


,O,CB,CG,CDl,0D2 






TIB; 


:TRP 










260: 


N , C A , 0 , 0 , CB , CG , 


CD1 , CD2 , NE1 , CE2 , CE3 , 


CZ2, 




TIB' 


:TYR 


261:N,CA,C, 


,O,CB,CG,CDl,0D2,CEl, 


CE2, 


35 


TIB 


: GLY 


263:N,CA,C I 


,0 






TIB 


:LEU 


264:N,CA,C, 


,0,CB,CG f CD1,CD2 






TIB 


:ILE 


265:N,CA,C, 


,0,06,001,002,001 






TIB 


:GLY 


266:N,CA,C, 


rO 






TIB 


:THR 


267:N,CA,C J 


,0,CB,0G1,CG2 




40 


Subset RESTX: 







restxmole. list 
Subset RESTX: 

NEWMODEL: 14, 16, 18-20, 31-34, 36, 38, 40, 48-50, 56-66, 68, 78- 
79,88,91-93, 

45 NEWMODEL: 104-106, 120, 136, 225, 227-229, 250, 262, 268 
restxatom. list 
Subset RESTX: 

NEWMODEL : ALA 14:N,CA,C,0,CB 

NEWMODEL: TYR 16 :N, CA, C,0, CB, CG, CD1 , CD2 , CE1, CE2 , CZ, OH 
50 NEWMODEL: ALA 18 :N,CA,C,0,CB 

NEWMODEL : ALA 19:N,CA,C,0,CB 

NEWMODEL : ALA 20:N,CA,C,0,CB 

NEWMODEL : GLY 31:N,CA,C,0 

NEWMODEL: THR 32 :N,CA,C,0,CB,0G1,CG2 
55 NEWMODEL : ASN 3 3 : N , CA , C , O , CB , CG , 0D1 , ND2 

NEWMODEL: ILE 34 :N,CA,C,0,CB,CG1,CG2 ,CD1 

NEWMODEL : CYS 36 :N, CA, C, O, CB, SG 
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10 



15 



20 



25 



30 



35 



NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 



GLY 
ALA 
ASP 
ALA 
THR 
GLU 
ASP 
SER 
GLY 
VAL 
GLY 
ASP 
VAL 
THR 
GLY 
PHE 
ALA 
LEU 
SER 
ASN 
GLY 
ASN 
LEU 
CYS 
SER 
GLY 
VAL 
PRO 
GLY 
LEU 
VAL 
PRO 
PRO 
PHE 
CYS 



38 
40 
48 
49 
50 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
68 
78 
79 
88 
91 
92 
93 



N,CA,C 
N,CA,C 
N,CA, C 
N,CA, C 
N,CA,C 
N,CA, C 
N,CA, C 
N,CA,C 
N,CA,C 
N,CA, C 
N,CA,C 
N,CA, C 
N,CA,C 
N,CA,C 
N,CA,C 
N, CA,C 
N,CA,C 
N, CA,C 
N, CA,C 
N, CA,C 
N,CA,C 
N, CA, C 
N,CA,C 
104:N,CA, 
105:N,CA, 
106:N,CA, 
120:N,CA, 
136:N,CA, 
225:N,CA, 
227:N,CA, 
228:N,CA, 
229:N,CA, 
250:N,CA, 
262:N,CA, 
268:N / CA / 



CZ 



O 

,0,CB 

,0,05,00,001,002 
r O,CB 

f 0,CB,0Gl,CG2 
,0,CB,CG,CD,OE1,OE2 
,0,CB,CG,0D1,0D2 
,0,CB,OG 
O 

0,CB,CG1,CG2 
O 

0,CB,CG,0D1,0D2 
0,CB,CG1,CG2 
,0,CB,0G1,CG2 

,0 

, 0 , CB , CG , CD1 , CD2 , CE1 , CE2 , 
,0,CB 

0,08,00,001,002 
CB,OG 

CB, CG, 0D1,ND2 

,,u,CB,CG,0Dl,ND2 
1,0,06,00,001,002 
C,0,CB,SG 
C,0,CB,OG 
,0,0 

,C,0,CB,CG1,CG2 
,CD,C,0,CB,CG 
,0,0 

,0,0,06,00,001,002 
r C,0,CB,CGl,CG2 
r CD, 0,0, CB, CG 
,0D,C,O,CB,CG 

, C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 
, C , O , CB , SG 



Example 10 

Providing a lipase variant E87K+D254K 
The Humicola lanuginosa lipase variant E87K+D254K was 
40 constructed, expressed and purified as described in WO 
92/05249, 



Example 11 

Lipase-S-PEG 15.000 conjugate 
45 The lipase variant E87K+D254K-SPEG conjugate was prepared as 
described in Example 7, except that the enzyme is the Humicola 
lanuginosa lipase variant (E87K+D254K) described in Example 10 
and the polymer is mPEG15,000. 



50 Example 12 
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Tmmunoaenecitv assessed as IqG^ of lipase variant fD87K+D254K) in 
Balb/C mice 

Balb/c mice were immunized by subcutanuous injection of: 

i) 50 ill 0.9% (wt/vol) NaCl solution (control group, 8 mice) 
5 (control), 

ii) 50jil 0.9% (wt/vol) NaCl solution containing 25 \ig of protein 
of a Humicola lanuginosa lipase variant (E87K+D254K) (group 1, 

8 mice) (unmodified lipase variant), 

iii) 50% 0.9% (wt/vol) NaCl solution containing a Humicola 

10 lanugoinosa lipase variant substituted in position D87K+D254K and 

coupled to a N-succinimidyl carbonate activated mPEG 15, 000 (group 

2, 8 mice) (lipase-SPEGIS, 000) . 

The amount of protein for each batch was measured by optical 

density measurements- Blood samples (200 were collected 

15 from the eyes one week after the immunization, but before the 

following immunization. Serum was obtained by blood clothing, 

and centrifugation. 

The IgGi response was determined by use of the Balb/C mice 

Igd ELISA method as described above. 
20 Results : 

Five weekly immunizations were required to elicit a 
detectable humoral response to the unmodified Humicola 
lanuginosa variant. The antibody titers elicited by the 
conjugate (i.e. lipase-SPEGIS, 000 ranged between 960 and 1920, 
25 and were only 2 to 4x lower than the antibody titer of 3840 
that was elicited by unmodified HL82-Lipolase (figure to the 
left) . 

The results of the tests are shown in Figure 1 

As will be apparent to those skilled in the art, in the light 
30 of the foregoing disclosure, many alterations and modifications 
are possible in the practice of this invention without departing 
from the spirit or scope thereof. Accordingly, the scope of the 
invention is to be construed in accordance with the substance 
defined by the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 
5 (A) NAME: Novo Nordisk A/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsveard 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP): DK-2880 
10 (G) TELEPHONE: +45 4444 8888 

(H) TELEFAX: +45 4449 3256 

(ii) TITLE OF INVENTION: A modified polypeptide 

(iii) NUMBER OF SEQUENCES: 9 
(iv) COMPUTER READABLE FORM: 

15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

20 (2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus sp. PD498, NCI MB No. 40484 
(ix) FEATURE: 

3 0 (A) NAME /KEY: CDS 

(B) LOCATION:!. .840 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TGG TCA CCG AAT GAC CCT TAC TAT TCT GCT TAC CAG TAT GGA CCA CAA 48 
35 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 
15 10 15 

AAC ACC TCA ACC CCT GCT GCC TGG GAT GTA ACC CGT GGA AGC AGC ACT 96 
Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 
40 20 25 30 

CAA ACG GTG GCG GTC CTT GAT TCC GGA GTG GAT TAT AAC CAC CCT GAT 144 
Gin Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 



45 



CTT GCA AGA AAA GTA ATA AAA GGG TAC GAC TTT ATC GAC AGG GAC AAT 192 
Leu Ala Arg Lys Val He Lys Gly Tyr Asp Phe He Asp Arg Asp Asn 
50 55 60 



50 AAC CCA ATG GAT CTT AAC GGA CAT GGT ACC CAT GTT GCC GGT ACT GTT 240 
Asn Pro Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

GCT GCT GAT ACG AAC AAT GGA ATT GGC GTA GCC GGT ATG GCA CCA GAT 288 
55 Ala Ala Asp Thr Asn Asn Gly He Gly Val Ala Gly Met Ala Pro Asp 

85 90 95 

ACG AAG ATC CTT GCC GTA CGG GTC CTT GAT GCC AAT GGA AGT GGC TCA 336 
Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
60 100 105 110 

CTT GAC AGC ATT GCC TCA GGT ATC CGC TAT GCT GCT GAT CAA GGG GCA 384 
Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 
115 120 125 



65 



AAG GTA CTC AAC CTC TCC CTT GGT TGC GAA TGC AAC TCC ACA ACT CTT 432 
Lys Val Leu Asn. Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
130 135 140 



70 AAG AGT GCC GTC GAC TAT GCA TGG AAC AAA GGA GCT GTA GTC GTT GCT 



480 
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Lys Ser Ala Val Asp Tyr Ala Trp Asn Lys Gly Ala Val Val Val Ala 
145 150 155 160 

GCT GCA GGG AAT GAC AAT GTA TCC CGT ACA TTC CAA CCA GCT TCT TAC 528 
5 Ala Ala Gly Asn Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

CCT AAT GCC ATT GCA GTA GGT GCC ATT GAC TCC AAT GAT CGA AAA GCA 576 
Pro Asn Ala He Ala Val Gly Ala He Asp Ser Asn Asp Arg Lys Ala 
10 180 185 190 

TCA TTC TCC AAT TAC GGA ACG TGG GTG GAT GTC ACT GCT CCA GGT GTG 624 
Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 



15 



AAC ATA GCA TCA ACC GTT CCG AAT AAT GGC TAC TCC TAC ATG TCT GGT 672 
Asn He Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Met Ser Gly 
210 215 220 



20 ACG TCC ATG GCA TCC CCT CAC GTG GCC GGT TTG GCT GCT TTG TTG GCA 720 
Thr Ser Met Ala Ser Pro His Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

AGT CAA GGT AAG AAT AAC GTA CAA ATC CGC CAG GCC ATT GAG CAA ACC 768 
25 Ser Gin Gly Lys Asn Asn Val Gin He Arg Gin Ala He Glu Gin Thr 
245 250 255 

GCC GAT AAG ATC TCT GGC ACT GGA ACA AAC TTC AAG TAT GGT AAA ATC 816 
Ala Asp Lys He Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
30 260 265 270 

AAC TCA AAC AAA GCT GTA AGA TAC 840 
Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

35 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 280 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

45 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 
15 10 15 

Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 
20 25 30 



50 



Gin Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 



Leu Ala Arg Lys Val He Lys Gly Tyr Asp Phe He Asp Arg Asp Asn 
55 50 55 60 

Asn Pro Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

60 Ala Ala Asp Thr Asn Asn Gly He Gly Val Ala Gly Met Ala Pro Asp 

85 90 95 

Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
100 105 110 

65 

Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 
115 120 125 

Lys Val Leu Asn Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
70 130 135 140 
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Lys Ser Ala Val Asp Tyr Ala Trp Asn Lys Gly Ala Val Val Val Ala 
145 150 155 160 

5 Ala Ala Gly Asn Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

Pro Asn Ala He Ala Val Gly Ala He Asp Ser Asn Asp Arg Lys Ala 
180 185 190 

10 

Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 

Asn He Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Met Ser Gly 
15 210 215 220 

Thr Ser Met Ala Ser Pro His Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

2 0 Ser Gin Gly Lys Asn Asn Val Gin He Arg Gin Ala He Glu Gin Thr 
245 250 255 

Ala Asp Lys He Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
260 265 270 

25 

Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

(2) INFORMATION FOR SEQ ID NO: 3: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
35 (ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

40 Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 

1 5 10 15 



45 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

55 Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 



60 



Ser Gly Ser Gly Ser Val Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 HO 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 

65 130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Gly Ser He Ser 

145 150 155 160 

70 Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
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165 



170 



175 



Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 



(2) INFORMATION FOR SEQ ID NO; 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Arthromyces ramosus 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Gin Gly Pro Gly Gly Gly Gly Gly Ser Val Thr Cys Pro Gly Gly Gin 
1 5 10 15 

Ser Thr Ser Asn Ser Gin Cys Cys Val Trp Phe Asp Val Leu Asp Asp 
20 25 30 

Leu Gin Thr Asn Phe Tyr Gin Gly Ser Lys Cys Glu Ser Pro Val Arg 
35 40 45 

Lys He Leu Arg He Val Phe His Asp Ala He Gly Phe Ser Pro Ala 
50 55 60 

Leu Thr Ala Ala Gly Gin Phe Gly Gly Gly Gly Ala Asp Gly Ser He 
65 70 75 80 

He Ala His Ser Asn He Glu Leu Ala Phe Pro Ala Asn Gly Gly Leu 
85 90 95 

Thr Asp Thr He Glu Ala Leu Arg Ala Val Gly He Asn His Gly Val 
100 105 HO 

Ser Phe Gly Asp Leu He Gin Phe Ala Thr Ala Val Gly Met Ser Asn 
115 120 125 

Cys Pro Gly Ser Pro Arg Leu Glu Phe Leu Thr Gly Arg Ser Asn Ser 
130 135 140 

Ser Gin Pro Ser Pro Pro Ser Leu He Pro Gly Pro Gly Asn Thr Val 
145 150 155 160 

Thr Ala He Leu Asp Arg Met Gly Asp Ala Gly Phe Ser Pro Asp Glu 
165 * 170 175 

Val Val Asp Leu Leu Ala Ala His Ser Leu Ala Ser Gin Glu Gly Leu 
180 185 190 

Asn Ser Ala He Phe Arg Ser Pro Leu Asp Ser Thr Pro Gin Val Phe 
195 200 205 
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Asp Thr Gin Phe Tyr He Glu Thr Leu Leu Lys Gly Thr Thr Gin Pro 
210 215 220 

Gly Pro Ser Leu Gly Phe Ala Glu Glu Leu Ser Pro Phe Pro Gly Glu 
5 225 230 235 240 

Phe Arg Met Arg Ser Asp Ala Leu Leu Ala Arg Asp Ser Arg Thr Ala 
245 250 255 

10 Cys Arg Trp Gin Ser Met Thr Ser Ser Asn Glu Val Met Gly Gin Arg 

260 265 270 



15 



Tyr Arg Ala Ala Met Ala Lys Met Ser Val Leu Gly Phe Asp Arg Asn 
275 280 285 

Ala Leu Thr Asp Cys Ser Asp Val He Pro Ser Ala Val Ser Asn Asn 
290 295 300 



Ala Ala Pro Val He Pro Gly Gly Leu Thr Val Asp Asp He Glu Val 
20 305 310 315 320 

Ser Cys Pro Ser Glu Pro Phe Pro Glu He Ala Thr Ala Ser Gly Pro 
325 330 335 

25 Leu Pro Ser Leu Ala Pro Ala Pro 

340 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (vi) ORIGINAL SOURCE: 

(B) STRAIN: Humicola lanuginosa DSM 4109 
(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION:!.. 66 
40 (ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 67. .876 
(ix) FEATURE: 

(A) NAME /KEY: CDS 
45 (B) LOCATION: 1 . . 876 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG AGG AGC TCC CTT GTG CTG TTC TTT GTC TCT GCG TGG ACG GCC TTG 48 
Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser- Ala Trp Thr Ala Leu 
50 -22 -20 -15 -10 

GCC AGT CCT ATT CGT OGA GAG GTC TCG CAG GAT CTG TTT AAC CAG TTC 96 

Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 1 5 10 

55 

AAT CTC TTT GCA CAG TAT TCT GCA GCC GCA TAC TGC GGA AAA AAC AAT 144 

Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 

60 GAT GCC CCA GCT GGT ACA AAC ATT ACG TGC ACG GGA AAT GCC TGC CCC 192 
Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

GAG GTA GAG AAG GCG GAT GCA ACG TTT CTC TAC TCG TTT GAA GAC TCT 240 
65 Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
45 50 55 

GGA GTG GGC GAT GTC ACC GGC TTC CTT GCT CTC GAC AAC ACG AAC AAA 288 
Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
70 60 65 70 
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TTG ATC GTC CTC TCT TTC CGT GGC TCT CGT TCC ATA GAG AAC TGG ATC 336 
Leu He Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
75 80 85 90 

5 

GGG AAT CTT AAC TTC GAC TTG AAA GAA ATA AAT GAC ATT TGC TCC GGC 384 
Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cys Ser Gly 
95 100 105 

10 TGC AGG GGA CAT GAC GGC TTC ACT TCG TCC TGG AGG TCT GTA GCC GAT 432 
Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

ACG TTA AGG CAG AAG GTG GAG GAT GCT GTG AGG GAG CAT CCC GAC TAT 480 
15 Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
125 * 130 135 

CGC GTG GTG TTT ACC GGA CAT AGC TTG GGT GGT GCA TTG GCA ACT GTT 528 
Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
20 140 145 150 

GCC GGA GCA GAC CTG CGT GGA AAT GGG TAT GAT ATC GAC GTG TTT TCA 576 
Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
155 160 165 170 



25 



TAT GGC GCC CCC CGA GTC GGA AAC AGG GCT TTT GCA GAA TTC CTG ACC 624 
Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 



30 GTA CAG- ACC GGC GGA ACA CTC TAC CGC ATT ACC CAC ACC AAT GAT ATT 672 
Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
190 195 200 

GTC CCT AGA CTC CCG CCG CGC GAA TTC GGT TAC AGC CAT TCT AGC CCA 720 
35 Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
205 210 215 

GAG TAC TGG ATC AAA TCT GGA ACC CTT GTC CCC GTC ACC CGA AAC GAT 768 
Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
40 220 225 230 

ATC GTG AAG ATA GAA GGC ATC GAT GCC ACC GGC GGC AAT AAC CAG CCT 816 
He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 



45 



AAC ATT CCG GAT ATC CCT GCG CAC CTA TGG TAC TTC GGG TTA ATT GGG 864 
Asn lie Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 



50 ACA TGT CTT TAG 876 
Thr Cys Leu * 
270 

(2) INFORMATION FOR SEQ ID NO: 6: 
55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser Ala Trp Thr Ala Leu 
-22 -20 -15 -10 

65 Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 1 5 10 



70 



Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 
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Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
5 45 50 55 

Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
60 65 70 

10 Leu He Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
75 80 85 90 

Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cys Ser Gly 
95 100 105 

15 

Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
20 125 130 135 

Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
140 145 150 

25 Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
155 160 165 170 

Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 

30 

Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
190 195 200 

Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
35 205 210 215 

Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
220 225 230 

40 He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 

Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 

45 

Thr Cys Leu * 
270 

50 (2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R28K oligo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

60 gggatgtaac caagggaagc agcactcaaa eg 32 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "R62K oligo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
5 cgactttatc gataaggaca ataaccc 27 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R169K oligo" 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



caatgtatcc aaaacgttcc aaccagc 



27 
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Patent claims 

1. A polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
5 polypeptide, having been modified in a manner to increase the 

number of attachment groups on the surface of the polypeptide, in 
comparison to the number of attachment groups available on the 
corresponding parent polypeptide, and/or 

b) one or more fewer polymeric molecules coupled to the 
10 polypeptide, having been modified in a manner to decrease the 

number of attachment groups at or close to the functional site(s) 
of the polypeptide, in comparison to the number of attachment 
groups available on the corresponding parent polypeptide, 

2. The conjugate according to claims 1, having 1 to 25, 
15 preferably 1 to 10 additional polymeric molecules coupled to the 

surface of the polypeptide in comparison to the number of 
polymeric molecules of a conjugate prepared from the corresponding 
parent enzyme. 

3. The conjugate according to claims 1 and 2, wherein the 
20 additional attachment group(s) is (are) amino groups in the form of 

Lysine residues (s), or carboxylic groups in the form of Aspartic 
acid or Glutamic acid residues. 

4. The conjugate according to any of claims 1 to 3, wherein 
the additional attachment group (s) is (are) prepared by a 

25 conservative substitution of an amino acid residue, such as an 
Arginine to Lysine substitution. 

5. The conjugate according to claims 1 to 3, wherein the 
additional attachment group (s) is (are) prepared by a conservative 
substitution of an amino acid, such as an Aspargine to 

30 Aspartate/ Glutamate or a Glutamine to Aspartate/Glutamate 
substitution. 

6. The conjugate according to any of claims 1 to 5, wherein 
the added attachment group is located more than 5 A, preferably 8 
A, especially 10 A from the functional site. 

35 7. The conjugate according to claim 1, having 1 to 25 

preferably 1 to 10 fewer polymeric molecules coupled at or close 
to the functional site of the polypeptide in comparison to the 
number of polymeric molecules of a conjugate prepared on the basis 
of the corresponding parent polypeptide. 
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8. The conjugate according to claim 7, wherein the removed 
attachment group (s) is (are) amino groups in the form of Lysine 
residues (s) , or carboxylic groups in the form of Aspartic acid or 
Glutamic acid residues. 
5 9. The conjugate according to any of claims 7 and 8, wherein 

the removed attachment group (s) is (are) prepared by a conservative 
substitution of an amino group, such as Lysine to Arginine 
substitution. 

10. The conjugate according to any of claims 7 to 8, wherein 
10 the removed attachment group (s) is (are) prepared by a conservative 

substitution of a carboxylic group, such as an Aspartate/Glutamate 
to Aspargine or Aspartate/Glutamate to a Glutamine substitution. 

11. The conjugate according to any of claims 1 to 10, wherein 
the removed attachment group is located within 5 A, preferably 8 

15 A, especially 10 A from the functional site. 

12. The conjugate according to any of claims 1 to 11, wherein 
the attachment groups are broadly spread. 

13. The conjugates according to claims 1 to 12, wherein the 
parent polypeptide moiety of the conjugate has a molecular weight 

20 from 1 to 100 kDa, preferred 15 to 100 kDa. 

14. The conjugate according to claim 13, wherein the parent 
polypeptide moiety of the conjugate has a molecular weight of from 
1 to 35 kDa. 

15. The conjugates according to claim 14, wherein the parent 
25 polypeptide is an enzyme selected from the group of 
Oxidoreductases, including laccases and Superoxide disrautase 
(SOD); Hydrolases, including proteases, especially subtilisins, 
and lipolytic enzymes; Transferases, including Transglutaminases 
(TGases) ; Isomerases, including Protein disulfide Isomerases 
30 (PDI) . 

16. The conjugate according to claim 15, wherein the parent 
enzyme is PD498, Savinase®, BPN" , Proteinase K, Proteinase R, 
Subtilisin DY, Lion Y, Rennilase®, JA16, Alcalase® or a Humicola 
lanuginosa lipase, such as Lipolase®. 
35 17. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a PD498 variant with one or more of the 
following substitutions: R51K, R62K, R121K, R169K, R250K, R28K, 
R190K, P6K, Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, 
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G87K, I88K, N209K, A211K, N216K, N217K, G218K, Y219K, S220K, 
Y221K, G262K. 

18. The conjugate according to claim 17 , with one of the 
following mutations: R28K+R62K, R28K+R169K, R62K + R169K, 
5 R28K+R69K+R169K. 

19. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Savinase® variant with one or more of 
the following substitutions: R10K, R19K, R45K, R145K, R170K, 
R186K, R247K, K94R, P5K, P14K, T22K, T38K, H39K, P40K, L42K, 

10 L75K, N76K, L82K, P86K, S103K, V104K, S105K, A108K, A133K, 
T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, G211K, S212K, 
T213K, A215K, S216K, N269K. 

20. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Humicola lanuginosa lipase variant 

15 with one or more of the following substitutions: 

R133K,R139K,R160K,R179K,R209K,R118K,R125K,A18K,G31K,T32K, 
N33K,G38K,A40K / D48K,T50K / E56K f D57K,S58K,G59K # V60K f G61K / D62K / 
T64K / L78K r E87K f N88K,G91K / N92K,L93K / S105K / G106K f V120K # P136K,G225 
K / L227K,V228K,P229K,P250K,D254K,F262K. 

20 21. The conjugate according to claim 20 with the following 

mutations E87K+D254K. 

22. The conjugate according to any of claims 1 to 21, wherein 
the polymeric molecules coupled to the polypeptide have a 
molecular weight from 1 to 60 kDa, especially 1-35 kDa, especially 

25 3 to 25 kDa. 

23. The conjugate according to claim 22 , wherein the poly- 
meric molecule is selected from the group comprising a natural or 
synthetic homo- and heteropolymers, selected from the group of the 
synthetic polymeric molecules including Branched PEGs, poly-vinyl 

30 alcohol (PVA) , poly-carboxyl acids, poly-(vinylpyrolidone) and 
poly-D,L-amino acids, or natural occurring polymeric molecules 
including dextrans, including carboxymethyl-dextrans, and 
celluloses such as methylcellulose, carboxymethylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, and 

35 hydrolysates of chitosan, starches, such as hydroxyethyl -starches, 
hydroxypropyl-starches, glycogen, agarose, guar gum, inulin, 
pullulans, xanthan gums, carrageenin, pectin and alginic acid. 

24. A method for preparing improved polypeptide-polymer 
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conjugates comprising the steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
5 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
10 selected in step b) at or close to the functional site, 

d) coupling polymeric molecules to the mutated polypeptide. 

25. The method according to claim 24, wherein the 
identification of amino acid residues located on the surface on 
the polypeptide referred to in step a) are performed by a computer 

15 program analyzing the 3D structure of the parent polypeptide in 
question. 

26. The method according to claim 24, wherein step b) 
comprises selecting Arginine or Lysine residues on the surface of 
the parent polypeptide. 

20 27. The method according to claim 24, wherein one or more 

Arginine residues identified in step b) is (are) substituted with a 
Lysine residue (s) in step c) . 

28. The method according to claims 27, wherein the 
substituted Arginine residues have a distance of more than 5 A, 

25 preferably 8 A , especially 10 A from the functional site. 

29. The method according to any of claims 24 to 28, wherein 
the polypeptide prepared in step . c) is coupled to polymeric 
molecules . 

30. Use of the conjugate in claims 1 to 23 for reducing the 
30 allergenicity of industrial products. 

31. Use of the conjugate in claims 1 to 23 for reducing the 
immunogenic ity of pharmaceuticals. 

32. A composition comprising a conjugate of any of claims 1 
to 23 and further comprising ingredients used in industrial 

35 products. 

33. The composition according to claim 32, wherein the 
industrial product is a detergent, such as a laundry, dish wash or 
hard surface cleaning product, or a food or feed product. 
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34. The composition according to claim 32, comprising a 
conjugate of any of claims 1 to 22 and further ingredients used in 
skin care products. 

35. A composition comprising a conjugate of any of claims 1 
5 to 23 and further comprising ingredients used in pharmaceuticals. 
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